FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTC TGAAGACTAACAT T T TGT GAAG T TG TAAAACAGAAAACCT G T TAGAA ATG T GGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCAT ACAT TACT GCAGTAACACT CCACCATATAGACCCGGC T TTACCT TATATCAG TGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCAT T TAT GT TCGT TAT AAGCAAGT TCAT GC T CT GAGTCCT GAAGAGAACGT TAT CATCAAA 
P k TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
3; CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
m TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
||| TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
III ATCAGT TTTGCACAGTGGCAATT TTGGGACTGATT TAGAACAGAAACTCCATTGGAACCCCGAGG 

^ ACAAAGGT TATGT GCT TCACATGAT CAC TAC TGCAGCAGAAT GG T C TATGTCATT T TCC T TCT T T 

GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
fi TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
111 GAGAT AT TTGAT GAAAGGATAAAATATT TC T GT AATGAT TATGAT T CT CAGGGAT TGGGGAAAGG 

(3 TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
^ GAT GAATGCTGAT AATCAGGAAACAT GAAAGAAGC CAT T TGATAGAT TATTC TAAAGGATATCAT 

U C AAG AAGAC TAT T AAAAACAC C TAT G C C TAT AC t T T T T TAT C T CAGAAAAT AAAG T C AAAAGAC T 

III 

ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LC I AT I YVRYKQVHAL S PEENVI I KLNKAGLVLG I LS CLGL S I VANFQKT TL FAAHVS GAVL T FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWGGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMI T TAAE WSMS FS FFG FFLT Y IRDFQK I S LRVEANLHGL TL YDTAPCP INNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCC ATGA GTT TCCT CATC GACT CCAGCAT CATGATTACCTCCCAGATAC TATT T TTT G 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
|^ T TTT CATGGTGCCTT TT TACATTGGCTATT T TATT GT GAGCAATATCCGACTACTGCATAAACAACGA 

p CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
Cl CATT CTCAGCCCAAAACATGGGAT CTT ATCCAT AGAACAGCTCAT CAGCCGGGTTGGTGT GAT TGGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
\\* C TCAGGAATGT GACTGACACGGATATT CTAGCCCT GGAACGGCGACT GCT GCAAACCAT GGATATGAT 

!^ CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGT T CCAGAAGGGGGAAGTGCATAACA 

m AAC CAT CAG GT T T C T G G GGAAT GATAAAAAG T GT TAC CAC T T C AG CAT CAGGAAGT GAAAAT C T T AC T 

% CTTAT TCAACAGGAAGTGGAT GCT TTGGAAGAATTAAGCAGGCAGCTT TTTCTGGAAACAGCT GATCT 

CI ATATGCTAC CAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTT TAATT T TCT TGGT T 

*F3 ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
f; GGGAAAACGGATCCT GT CACAAGAGGCATTGAGAT CACTGT GAATTAT CT GGGAAT CCAATTT GATGT 

S| GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 
m TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGC CT GACAT TTTAT AAACAAACAAAAT GCTATGGTAGC 
ATTTT TCAC CT TCATAGCATACTCCTTC CCC GTCAGGTGATACT ATGACCAT GAGT AGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCT GAGCCAAACACGTAGGAT TT CCGT TT TAAGGTT CACATGGAAAAGGTTATAGCTTT G 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNS S SRYFHWKMNLC VILL I LVFMVP FYI GYFI VSNI RLLHKQRLL FS CLLWLT FMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMANLARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEV 
LFLETADLYATKERIEYSKT FKGKYFNFLGYFFS I YCVWKI FMAT INI VFDRVGKTDPVTRGIE I 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLL I RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALSS I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACT CT CCAT CCGGACTAGT TAT T GAGCATCT GCC TC TCATAT CACCAGTGGCCAT C TGAGG T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GT CATGCAGAAT TATGGGGATCACCC T T GT GAGCAAAAAGGC GAACCAGCAGCTGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATT ATCACCACCAAAGATCCCATAT T CAACACTCAAAC TGCAACACAAACAACAGAAT T TAT TG T 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAACTAGCACCAT GTC TACAGAAACTGAACCAT T T GT TGAAAATAAAGCAGCAT T CAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAA.GGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAAT GAT CGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCC TAA 
TGAGGAATCAAAGAAAAC T GATAAAAACCCAGAAGAGTCCAAGAGT CCAAGCAAAAC TACCGTGC 
GATGCCTGGAAGCTGAAGTT TAGA TGAGACAGAAATGAGGAGACACACCTGAGGCTGGT TTCT T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGT CACT GAGACCAAGGC T T T CTC TACT GAT T CCGCAGC T CAGACC CTT TC T T CA 
GCTCTGAAAGAGAAACACGTATCCCACC TGACATGT CCT TCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAG T T TAGCCCC TGAAAGCCATGGAGAT T CT CATAACT TGAGACCTAAT CTCTGTAAA 
GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
ACAGACAGGGT CAAAGT GT T T TCTCTGAACACAT T GAGT TGGAAT CACTGT TTAGAACACACACA 
CT TACT TTTTCTGGTCTC TACCACT GCT GATATT T T C TC TAGGAAAT ATAC TT T TACAAGTAACA 
AAAATAAA?\ACTCT TATAAATTT CTATT TTTATCTGAGT TACAGAAATGATTACTAAGGAAGAT T 
AC TCAGTAAT TTG T TTAAAAAGT AAT AAAAT T CAACAAACAT T TGCT GAATAGCTAC TATATG T C 
AAGT GCTGTGCAAGGTAT TACACTCTGTAAT TGAATATTATTCCTCAAAA&AT TGCACATAGTAG 
AACGCTATCTGGG7VAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TT T TAT T T T T GCTGAGAC TAATCT TAT T CAT T T TC TCTAATAT GGCAACCAT TAT AACC T TAAT T 
TATTAT TAACATACCTAAGAAGTACATT GTTACCT CTATATACCAAAGCACATTT TAAAAGTGCC 
AT T AACAAAT GT AT CAC TAGCCCT CCT T T TT CCAACAAGAAGGGAC T GAGAGATGCAGAAATAT T 
TGT GACAAAAAATT AAAGCATT TAGAAAACT T 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LSLAGKDQVETALKASFETCSYGWVGDGFWISRISPNPKCGKNGVGVLIWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVIALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

&l amino acids 1-16 

f £>: 

y| Transmembrane domain: 

!:ri amino acids 235-254 

. ? 

N-glycosylation site. 

f| amino acids 53-57, 130-134, 289-293 

II! 

Casein kinase II phosphorylation site. 

1*3 

amino acids 145-149, 214-218 

1*7 



Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGC TA 
AAGCATCAT CAGAAGTGAACCTGGCAAACT TACC TCCCAGC TAT CACAATGAGACCAACACAGAC 
ACGAAGGTT GGAAATAATACCAT CCAT GTGCACC GAGAAAT TCACAAGATAACCAACAACCAGAC 
t . T GGACAAAT GGT CT TT TCAGAGACAG T TATCACAT CTGTGGGAGACGAAGAAGGCAGAAGGAGCC 

J!! AC GAGT GCAT CATCGACGAGGAC T GT GGGCC CAGCATGTAC TGCCAGT T TGCCAGC TT C CAGTAC 

|jf ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
O GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
?P ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
If! ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
HI CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
.ft CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
iff GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
* W CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAGA TCTGGACCAGGCTGTGGGTAGATGTGCAA 
M TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
ifl CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Q CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
||| GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
fj T T T GT TC TACAT GGCT T TGATAAT TGT T TGAGGGGAGGAGATGGAAACAATG TGGAGT C T CCCTC 

f|| TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGT TT CATC TGGTTGT GAG T C TAAGCT CAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAA.GTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACT G T TAGGAACAGCAGT GT T C T CACAGT GTGGGGCAGC CGTCC TTC TAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGATiAGGTATATGACTGAGCGTAGCA 
TACAGG T TAACC TGCAGAAACAGT AC T TAGG TAAT T G TAGGGC GAGGAT TATAAAT GAAATT TGC 
AAAAT CAC T TAGCAGCAAC T GAAGACAAT TAT CAACCACGT GGAGAAAAT CAAACC GAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT T GTATAAGC ATGC TT T CT TT GAGT T TTAAAT TATGTATAAACATAAGT T GCAT T TAGAA 
AT CAAGC ATAAAT CAC T T C AAC T G C AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



0% 



us 

Hp 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSETOIJysrLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEEI 

Signal sequence: 

amino acids 1-19 



!|1 N-glycosylation site. 

tfl amino acids 96-100, 106-110, 121-125, 204-208 

III 

P?l Casein kinase II phosphorylation site. 

!|| amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 

O 327-331 



N-myristoylation site. 

amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGA^CCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCT.GTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
T T GAGAGGC T GAGGC GGGCGGAT CACC TGAGT CAGGAGTTCGAGACCAGCC TGGCCAACATGGT G 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GC TAC T CGGGAGGC T GAGGCAGGAGAATCGC T T GAACCCGGGAGGCAGAGGT TGCAGTGAGCCGA 
GAT CGCGCCACTGCAC T CCAACC T GGGTGACAGAC TC TG TC T CCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT T T T GT TAAC TC 
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FIGURE 10 



RTRGRTRGGCEKV-PINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I P T FPL I SAFI RT LRYHTGSLAFGAL I LTLVQ I ARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
U GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATGCTGGTGGGCCTC 
CI ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
fl GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
J= TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
f|{ CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 

? TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

*n 

*** GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 

111 

J! TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
f I AACAGCCT GAT GCAGACCGAGGAGT GCCCACCT AT GCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 

f£ GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
f|j ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGGTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVIASLHVMMTLTNWYKPGETRKMISTWTAV1WKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence : 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
C T CT CACCGGGAGCCAGAGC TCC CAT GCT TCT CT GCGCAATATC CAT TCCAT CAACCCCACACAA 
CT CATGGC CAGGAT T GAGT CCTAT GAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT TGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTAT TCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCAC TTTTAGAACTATGAGTACTACT TT TGTTAAATGTGAAAAACCCTCACAGAAAGTC 
AT CGAGGCAAAAAGAGGCAGGCAGT GGAGT C TCCCT GT CGACAGTAAAGT TGAAAT GGTGACGT C 
CACTGCTGGCTTTATTGAACAGCTAATAAAGATTTATTTATTGTAATACCTCACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAG AC T T AAT AT AT T GAAG T AAC AC T T T T T TAG T AAGCAAG AT AC C T T T T TAT T T C AAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CT T GCAAAAGGGGAAGAAAGGAAT T GCGAATACAT GTAAAAT GT CACCAGACAT T T GTATTAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT TAAACCAT TCATATCATGTT TCCT TTGCGT TCAGCCAATT TCAAT TAAAATGAA 
CTAAAT TAAAAA 



ID=10063592 



Page 163 



FIGURE 14 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI I ELNVNGGI ENTLEKEVMQ YDYYS S YFDI FLLAVFRFKVL I IAYAVCRLRHWWAI AL T T 
AVTSAFLLAKVILSKLFSQGAFGWLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 



f|| N-myristoylation sites. 

* amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
y* AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
fH CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
3* GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
Z*l TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
W 1 CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
III TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
111 TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
ill CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
? TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
|3 CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
|S ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
3f GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
tl GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
Iff AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
g| GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 



GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T T AGG AAC AT GTTTTGCTTTTT T AAAAT AT AT AT AT T T AT AAG AG AT C C T T T C C CAT T TAT T C T 
GGGAAGATGT TTTT CAAACTCAGAGACAAGGACTT TGGT TT T T GTAAGACAAACGATGATATGAA 
GGCCT T T TG TAAGAAAAAATAAAAGAT GAAGT GT GAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 

MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 

LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 

ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 

AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAAiUSTPFNCVCPLSWFGPWVRESHVTLASPEE 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 

SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCES^IGQGTRPSPTP 

VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 

TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 

LLAAIAAVGAAYCVRRGRMIAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSG 

CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATCCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGAT GAAT CCT TAGAT TCCAAGACTACT T T GACATCAGAT GAGT CAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT T TCTAGAGTCT CCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCT T TGAC CGCCAT TGAAGGCACAGCACAT GGGGAGCCCTGCCACTT CCC TT TTCT 
TTTCCTAGATAAGGAGTATGATGAAT GTACAT CAGATGGGAGGGAAGATGGCAGACT GT GGT GT G 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGGAGAT GCAGGAAGCAGAAATGAT GT ATCAAAC TGGAATGAAAATCCT TAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGG TATCT CCAAAAGGCAGCAAGCAT GAACCATA 
CCAAAGCCCT GGAGAGAGTGTCAT ATGC TCT T TT AT T T GGTGAT TACT TGCCACAGAATATCCAG 
GCAGCGAGAGAGAT GT T T GAGAAGC TGACT GAGGAAGGC TCT CCCAAGGGACAGAC T GC T CT TGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
T T GGAGC TC T TGGGGGCAATC TAATAGCCCACAT GGT T T TGGTAAGT AGACT T TAG T GGAAGGC T 
AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 
CAGAT T TGC T TGTAT TAAGACCAAATAT TCAGT T GAAC T TCCT TCAAAT TCT TGT TAAT GGATAT 
AACACAT GGAATCTACATGTAAAT GAAAGT T GGT GGAGTCCACAAT T TT TCT TTAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAA&AGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T GTCAGAAT CAT T T TT TACATTAGAT TATCATAAT TT TAAAAAT T T TT CT T TAGT T TT TCA 
AAATTT TGTAAATGGTGGCTATAGAAAAACAACATGAAATAT TATACAATAT TTTGCAACAATGC 
CC TAAGAAT TGT TAAAAT TCAT GGAGT TAT T T GT GCAGAAT GACT C CAGAGAGC T C TAC T T TC T G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC TCAT T T TTAATAAAAT TATGTC TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAE 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT T CAGAT T TT AAGCCCAT T CT GCAGT GGAAT T T CATGAACTAGCAAGAGGACAC CATCT TC TT 

G TAT TATACAAGAAAGGAGTGT ACCT AT CACACACAGGGGGAAAAATGC TC T TT T GGGTGCTAGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGTACATT T T TAT CACT GGAT GTGAC TC GGGCTT TGGAAACT T GGCAGCCAGAACT T TT GAT 

AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 

AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGTGGTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 

CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

AT GCAT TGAACCAGGAT T GT TCAAAACAAAC T TGGCAGATCCAGTAAAGGTAAT TGAAAAAAAAC 

TCGCCATTTGGGAGCAGC TGT CT CCAGACAT CAAACAACAATATGGAGAAGGT T ACAT TGAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTT TCT T TGAAAAGGAGGGC TGGAATGGTACATCACATAGGCAAGT CCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

ATCATCTCT TATC TAAATAT T AAAAG AT AAG T CAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA 



FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
S TALKAET S ERLRTVLLDVTDPEmTCRTA^ 

REPIEWLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNIADPVKVIEKKIAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPL^ 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites, 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACT GAT TCCCAAAT GGATGATGTTGAAGTTGTTTATACAAT TGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACT GAAGAAAAT AT TAT CAAATGT CAAAAAGAAT GTGGT AGG TTGGTACAAAT TC CGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTAC TC 
f*J ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
C| GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T T T AGC CGAGCAG T AC AAAC ACAC AGC T C TAAAT T T T T T GAAGAAGAT GGAT C C T T AAAGGAG G 
TACATAAGATAAATGAAAT GTAT GC T T CAT TACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGT G 
GAA.GACAGTGAACAAGCAGTAGATAAAC TAGT AAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 
GAAAA.GGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACAT CCAAAAAGACCC TCAGGAGA 
ACATT T T TC T TT GT CAGGCAT T ACGGACC T t T T T T CCAAAT TC TGAAT T TCT TCAT T CATGTG T T 
W'l ATGTCTTTAAAAftATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
Jjl AGACAAT C TGACC T TAATGGTAGAACACAC T GACAT TCCTGAAGCT AGT CCAGC TAGT ACACCAC 

?H AAAT CAT TAAGCATAAAGCCT T AGACT TAGAT GACAGAT GGCAAT T CAAGAGAT CT CGGT TGT TA 

g| GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
HI GAGCAGCCCAGAAACAGATGAAGAAAT T GAAAAGAT GAAGGGT T T T GGTGAATAT T CACGGTC T C 

CTACATT T TGA TCC TT T TAACCT TACAAGGAGAT T T T T T TAT T T GGC T GAT GGGTAAAGCC AAAC 
AT T T C TATT GT T T T TAC TAT GT T GAGC TAC T T GCAGTAAGT T CAT T TGT TT T TACT AT GT TCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCAT TTACTTCACAAAGTACTTT TTCAAACAT CA 
GAT GCTT TTAT TTCCAAACC TT TT TTTCACC TTTCACTAAGTT GTTGAGGGGAAGGCTTACACAG 
ACACAT TCTT TAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACC TGTAATCCCAGCACT 
TAGGGAAGACAAG T CAGGAGGAT T GAT TGAAGCTAGGAGT TAGAGACCAGCCTGGGCAACG TAT T 
GAGACCATGT CTAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCCT TAT T T T CAAAATATGGAAA 
GAAATT TATAT GAAAAT T TAT CTGAGTCAT TAAAAT TC TCC T TAAGT GATACT T T T T T AGAAG T A 
CAT TATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAAT TTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNS I TDSQMDDVEWYT I DIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCC^GCCGGACGAGCGGACCAGCGC^GGGCAGCCC^A 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTGCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCG C T GCT GGCT GCC GAG CT CAAGT C GAAAAGTT GCTC GGAAGT GC GACGT CTTTAC GT GT CCAAAGGCT T C 
r * S AACAAGAACGATGCCCCCCT CCACGAGATCAACGGTGATCATTTGAAGATCT GT CCCCAGGGTTCTACCTGCTGCTCT 

j* ? CAAGAGATGGAGGAGAAGT ACAGCCT G CAAAGTAAAGAT GATTT CAAAAGTGTGGTCAG CGAACAGT G CAATCATTT G 

CAAGCTGTCTTT GCTT CAC GTTACAAGAAGT T TGATGAATT CTT C AAAGAACTACT T GAAAAT GCAGAGAAATCC CT G 
AATGAT ATGT TT GT GAAGACATAT GGCCATT TATACATGCAAAATT CT GAGCTATTT AAAGAT CT CTT CGT AGAGT T G 
AAACGTTACTACGT G GT GGGAAAT GT GAAC CT GGAAGAAAT GCT AAAT GACT T CT GGGCT CGCCTCCTGGAGC GGAT G 
T T CC GC CTGGT GAAC T CCCAGTACCACTTTACAGAT GAGTAT CT GGAAT GTGTGAGCAAGTATACGGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
AT GAT CT ACT GCTCCCACT G CCG GGGT CT C GT GACT GT GAAGC CATGT T ACAACT ACT GCTCAAACAT CATGAGAGGC 
TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
GAGGGTC CT TT CAACAT T GAAT C GGTCAT GGAT C CCATCGAT GT GAAGAT TTCT GAT GCT AT TAT GAACATGCAG GAT 
AATAGT GT T CAAGT GT CT CAGAAGGTT TT C CAGGGAT GT GGAC C C CCCAAGC CC CTCC CAGCT GGAC GAATTT CT CGT 
T C CAT CT CT GAAAGT GCCT T CAGT G CT CG CTT CAGAC CACATCAC CC CGAGGAACGCC CAACCACAGCAGCT G GCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
GTT TGCAACGAT GAGAG GAT GGCT GCAGGAAACGGCAATGAGGAT GACT GTT GGAAT GGGAAAG GCAAAAGCAGGTAC 
CT GTTTGCAGT GACAGGAAATGGAT TAGCCAAC CAGGGCAACAAC CCAGAGGT C CAGGTT GACACCAGCAAACCAGAC 
ATACTGAT C CTT C GT CAAAT CAT GGCT CTT CGAGT GAT GACCAGCAAGAT GAAGAAT GCATACAAT GGGAACGAC GT G 
GACTT CTT T GAT AT CAGT GAT GAAAGTAGTGGAGAAGGAAGTGGAAGT GGCT GT GAGTAT CAGCAGT GCCCTTCAGAG 
TTT GACTACAAT GC CACT GACCAT GCT GGGAAGAGTGC CAAT GAGAAAGCCGACAGTGCT GGT GTC C GT CCTGGGGCA 
CAGGCCTAC CT C CT CACT GT CT T CT GCAT CTT GT T CCT GGTTAT GCAGAGAGAGT GGAGATAATT CT CAAACT CT GAG 
AAAAAGT GTT CAT CAAAAAGTTAAAAGGCACCAGTTAT CACTT TT CT ACCAT CCTAGT GACT TT GCT TTTTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCAT T GAGT T GGTTCCT GCT CCCCCAAACCAT GT TAAAC GT GGCTAACAGT GTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAATAT TAAAT AG CT GTAC AGAAGCAG GTT TTATTTAT CAT GTTAT CT T AT T AAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 



MARFGLPALLCTLAVLSAALLAAELKSKSCSEWRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKYSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMIK^AERLEGPFNIESVMDPIDWISDAIM^ 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFC I L FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE as 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C T GAGT CAT CCCCAGGGATCAGGAGCCTCCAGCAGGGAACC T TCCAT TATATTCT T CAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC T AG T G T CAT T T AAC C T T AAAT GCAAT C AGGAAAGTAGC AAAC AGAAGT C AAT AAAT AT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKEM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 



N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTGATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAACGGGAGCT CTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAG.TACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
T TCCCCAGCCTCCAAT TAGAACAAGCCACCCACCAGCC TAT C TAT CT TCCAC T GAGAGGGACC TA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GCCAGAGCAAGACTCAAAGAGGCAGAGGT T T TG T TCTCAAATAT T TT T TAATAAATAGACGAA 
ACCACG 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHIMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 

rqladgsmdvwctlvlcsvqsprkvm 

kh i gdgccl tret wkdlenaqfse i qmerqp pplkwlpvgph imgkavkqs fps skal i cs fp sl 

QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATG CTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC TGGGAGAT AGGGAACAGAAGAGGGTAGT GGGT GGGCT AGGGGGGC T GCC TT AT T TAAA 
GT GGT T G TT TAT GAT TC T TATACTAAT T TATACAAAGATAT TAAGGCCCT GT T CAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAGC T GAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 30 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 
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Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 



APPJD-10063592 Page 180 of 318 



FIGURE 31 

GTTTGAATTCCTTCA?ICTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
AT T TGCATC TGT T T TGATAAAT GAT GT TGACACCC TCCACC GAAT T CTAAGTGGAAT CATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TG T CT GCGGT G T T T TATGGT GGC T GTAT TATGACTATACCAACGAC CTCAGCAT AGAAT T GGACA 
CAGAAAGGGAAAAT AT GAAGT GCGT GC T GGGGTT TGCTATCGTAT C CACAGGCAT CACGGCAGT G 
CTGCTCGTCTT GAT TT T TGT TCT CAGAAAGAGAATAAAAT TGACAGTT GAGCT TT TGCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAA.TGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GAT T CCGAGAATCAT TGT CATGT ACAT GCAAAACGCAC TGAAAGAACAGCAGCATGGT GCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAATGCATATACTACAACTGCTATTAATGGGACAGAT TTCTGTACATCAGCAAAAGATGC 
AT T CAAAAT C T TGT CCAAGAACT CAAGT CAC T T T ACAT C TAT TAACT GC T T T GGAGAC T TCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GAT GGATCGT CAGAAAAGCC CTACT T TATGGATCAAGAAT T T CTGAGTT TCGT A 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCATTGTGAG ATAGA TACCCAT TTAGGTAT CT GTACCTGGAAAACATT 
T C C T T C T AAG AGC CAT T T AC AG AAT AGAAGAT G AG ACC AC T AG AG AAAAG T TAG T GAAT T T T T T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 22 

MSGRDT ILGLCI LALALSLMMFT FRFI TTLLVH I FI S LVI LGLLFVCGVLWWLY YDYTNDLS IE 
LDTERENMKC VLGFAI VS TG I TAVLLVL I FVLRKRI KLT VEL FQ I TNKAI S SAP FLL FQPLWT FA 
ILIFFWVLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTC YFNRSKNDPPDHP I LSSLS ILFFYHQGTWKGS FL I SWRI PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFT VFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FL S VFE TVLDAL FLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GT T CGAT TAGCT CCTC TGAGAAGAAGAGAAAAGGT T CT TGGACCT CT CCCTGT T TC T TCC T TAGA 

ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 

GAAAAT T T T T TGAAAAAAAAAT T GC CT TC T TCAAACAAGGGTGTCAT TC T GAT AT T T ATGA GGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 

GAT GTCAAAGCCGGAAAGATCAT CGATCCTGAGT TCAT TGTGAAATGTCCAGCAGGATGCCAAGA 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 

GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 

T ATCGT C TTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGC T C T TACATACT CATCAT 

CGAAAAGTCCAGC TGCCCAAGCAGGTGAGACCACAAAAGCC TAT CAGAGGCCACCTATT CCAGGG 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCCAGGAGATGGAT C T CTGGTCCAC T GCCACC TACACAAGCAGCCAAAACAGGCCC 

AGAGCTGATCCAGG TAT CCAAAGGCAAGAT CC TT CAGGAGC T GCC T TCCAGAAACC T GT TGGAGC 

GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAAT TGAGCACACAGTC TT TGGAGCCAGTATCCC 

TGGGAGATCCAAACTGCAAAAT TGACT T GTCGT T T T TAATTGAT GGGAGCACCAGCAT T GGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 

ACACGAAT TC TCGAGATC TGAAGACAGC CATAGAGAAAATTAC TCAGAGAGGAGGAC T T T C TAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAGT T T GAGATT T CCGACACGGACACGCGCAT CGGGGC CGT GCAGTACACC TACGAACAGCGGC 

TGGAGTT TGGGTTCGACAAGTACAGCAGCAAGCCTGACATCC T CAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GT CCAAGCCCAACAAGAGGAAGT TAAT GATCCTCATCAC CGACGGGAGGTCC TACGACGACGT CC 

GGAT CCCAGCC AT GGC T GCCCAT C T GAAGGGAGT GATCACCTATGCGATAGGCGT T GCC TGGGC T 

GC CCAAGAGGAGC TAGAAGTCAT TGCCAC TCACC CC GCCAGAGACCAC TCCT T CTT T GT GGACGA 

GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATT T GTACAGAGT TCAACTCAC 

AGCCTCGGAAC TGAAT T CAGAGCAGGCAGAGCACCAGCAAGT GCT GCT T TAC TAAC T GACGTGT T 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

TGATCACAAACGTATAGAAT GAGCCAAAAGGCTACATCAT GT T GAGGGT GCT GGAGAT T T TACAT 

TTT GACAAT T GT T T T CAAAATAAAT GT T CGGAAT ACAGTGCAGCCC T TACGACAGGCT TACGTAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

TTT TGTCATGACAATGTAGGAATT GCTGAATTAAAT GT TTAGAAGGATGAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FI VLE SKPKKGVT YPSAL TYS S SKS PAAQAGET TKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 
T PT TLPRPS P S AAST T S I PRPQS VGHRSQEMDLWS TAT YTS S QNRPRADPG I QRQDPS GAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNWWMVDGWPTDKVEEASRLARESGINI FFI T IEGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELE VI ATHPARDHS FF 
VDE FDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 

CCGAGC AC AGGAGATT GCCT GCGTT T AGGAGGTGGCT GCGTTGT GGGAAAAGCT AT CAAGGAAGAAAT TGC 
CAAACC AT GT CT T T TT TTC T GT TTT C AG AGT AGT TCAC AACAGATCT G AGTGTT T T AATT AAGC ATGG AAT 
ACAGAAAAC AAC AAAAAACT TAAGCT TT AATT TCATCT GGAATT CC AC AGTT TTCT T AGCTC CC TGGACC C 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGCCT T C CCC ACT AC AAT GTGAT AGAACGC GTG AACT GG AT GT ACTT CT AT GAGT AT GAGC CG AT TT A 
1 5 ? 5 C AGACAAG ACTT T C AC TTC AC ACT TCGAGAGC AT TCAAACTGCT CT CATC AAAATC CAT TTC TGGT C ATT C 

Q TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
CJ TGGTGGGGAT AT GAGGT T CT T AC ATT TT TCTT AT T AGGCCAAG AGGCT GAAAAGGAAGAC AAAATGT T GGC 

Cfl" AT TGT CCT T AGAGGAT GAAC ACCT TCTT TATGGT GAC ATAATCCGACAAGAT TTT T TAGACACAT AT AAT A 

$1 ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
*H AAGACAGAC ACT GATGT T TT CAT C AAT ACT GGCAAT T T AGTGAAGT AT CT TT T AAACCTAAACC ACTC AGA 

Jjt GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 

5 ej 

CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 

a; 

TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
||| CT GTTT GAAT T T AT TAAAAG TGAAC AT T CAT ATT CC AGAAGACACAAAT C TT TT CT TT CT AT AT AGAATC C 

Pf ATTT GGAT GTCT GT CAACTG AG ACGT GT GATT GCAGCC C ATGGC TT TTCT TCCAAGGAGAT CAT C ACT TT T 

||| TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

asps 

|y| GATACCTT GTGG AAAGTGT T AAAT AAAGT AGGTACTGT GG AAAATT C AT GGGGAGGTC AGT GTGCTGGCT T 

111 AC ACT G AACTGAAACT CAT G AAAAACCC AG AC TG GAGACT GG AGGGTT AC ACTTGT GATTTATT AGT C AGG 

CC CTTC AAAG AT GAT ATGT GGAGGAATT AAAT AT AAAGGAAT TGGAGGT T TT TGCTAAAG AAAT T AAT AGG 
AC C AAACAATTT GGAC AT GT CATT CT GT AG AC T AGAAT TT CT T AAAAGGGTG TT AC TGAGT T AT AAGCTC A 
CT AGGCTGTAAAAACAAAAC AATGT AGAGT T T T ATT T ATT GAAC AATGT AGT CACT TGAAGGTT TTGT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CT GAACAAAATT TT ACCT GT T T TT GGTC AT T TAT AAAGT ACT T CAAGATGTT GCAGT ATTT C AC AGT TAT T 
ATT AT T TAAAAT T ACTTCAACT TT GT GT TT T T AAAT GT TT TG AC GATTTC AATACAAG AT AAAAAGGATAG 
TGAAT CAT TCTT T ACATGC AAACATT TT CC AGT T ACT T AACT GATCAGTT T ATT AT TGAT AC AT CACT CCA 
TT AAT GT AAAGT CAT AGGT C AT TATT GC AT AT C AGT AATCTCT T GG ACT T TGTT AAAT ATT T TACTGT GGT 
AAT AT AGAGAAGAAT T AAAGC AAGAAAATCTGAAAA 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERWWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEiyiMGHV 
KP IKFEDVYVGICLNLLKVNIHI PEDTNLFFLYRIHLDVCQLRRVIAAHGFS SKE 1 1 TFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
T CGTC C TTGCC AAGAGAGTACACAGTCAT TAAT GAAGCC TGCCC TGGAGCAGAGT GGAATATC ATGTGTCGGGAGTGCT GTG 
AATAT GATCAGATTGAGTGCGTC TGCC C C GGAAAGAGGGAAGTC GTGGGTTATAC C ATCC C TTGC TGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGAT GAC T TC TATGT GAAGGGGT TCT AC TGT GCAGAGTGCCGAGCAGGCT GGTAC GGAGGAGAC TGCATGC GATGTG 
GCCAGGT TCTGC GAGCCCC AAAGGGTC AGAT T T TGTTGGAAAGC TATC CCC TAAATGCTC AC TGT GAATGGACCAT T CATGC 
T AAAC CT GGGT T T GTC ATCC AAC TAAGAT T TGTCATGTT GAGTC TGGAGTT TGAC T ACAT GT GC CAGTATGAC TATGT T GAG 
GTTCGTGAT GGAGACAACCGCGATGGC C AGATC ATCAAGCGTGT C TGTGGC AACGAGCGGCCAGC TCC TATCCAGAGCATAG 
GATC C TC AC TCCACGTCCT C T TC CAC TC CGATGGC TCCAAGAATT T TGACGGT T TCC ATGCCAT T TATGAGGAGATCACAGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGCAGCGCT GTGAAAATC TC C T TGAAGAAAGAAACTGCTC AGAC CC TGGGGGCCCAGT CAAT GGGTACCAGAAAA 
TAACAGGGGGCCCTGGGC TT ATC AACGGACGCCATGC TAAAATTGGCACCGTGGTGTCTTTCT TTTGTAAC AAC TCC TATGT 
TCT TAGTGGCAATGAGAAAAGAACTT GC CAGCAGAATGGAGAGTGGTCAGGGAAACAGCCC ATC TGCAT AAAAGCC TGCCGA 
GAAC C AAAGAT T TC AGACC TGGTGAGAAGGAGAGT TC TTCCGAT GCAGGTTCAGTCAAGGGAGAC ACC ATTACACCAGC TAT 
ACTC AGC GGCC T TCAGC AAGCAGAAAC T GCAGAGT GC CCC TACCAAGAAGCC AGC C CTTC CCT TTGGAGATC TGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
CTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
CCC AAGGGT TGC GC TGGCC GT GGCAGGC AGCCATC TACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTC ACCATGATC AAGACAGCAGACC TGAAAGTTGT T T TGGGGAAATT C TACCGGGATGATGAC CGGGAT GAGAAGACCATCC 
AGAGC CT AC AGATT TC TGCT ATCAT TC T GCATCCCAACTATGACCCCAT CC TGC TTGAT GC TGAC AT C GCCATCC TGAAGC T 
CC TAGACAAGGCC CGTATCAGCACC CGAGTCC AGC CCAT C TGCC T CGC T GCCAGTCGGGATCTC AGCAC T TCC TTCC AGGAG 
TCC CACATCAC TGTGGC TGGC TGGAAT GTCC TGGCAGAC GTGAGGAGCCCT GGC TTCAAGAACGACACAC TGCGC TCTGGGG 
TGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGCAGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTT 
CTGT GCCAGC TGGGAACCCACTGCC C C T TC TGATATC TGCACTGC AGAGAC AGGAGGCATCGC GGCTGT GTC C T TCCCGGGA 
CGAGC AT CT CC TGAGCCACGC TGGC AT C TGATGGGAC TGGTCAGC TGGAGC TAT GATAAAACATGCAGC CAC AGGCTCTC CA 
C TGCC TT CACCAAGGTGCT GCCT T T TAAAGAC T GGAT TGAAAGAAATATGAA AT GAA CCATGC TCATGCACTCC T TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCC AGGGCT T CTGAC TT CAGGGACAAAAC TC AGTGAAGGGT GAG T AGACC TCC AT TGC TGGT AGGC TGATGCCGC GTC CA 
C TAC TAGGACAGCCAAT TGGAAGATGC C AGGGC TT GC AAGAAGTAAGT T TC T TCAAAGAAGACCATATACAAAACCTCTCC A 
C TC C ACTGACC T GGTGGTC TTCCCCAAC TT TCAGT TATAC GAAT GCC ATCAGC T T GACCAGGGAAGATC TGGGCT TCAT GAG 
GCC CC TT T TGAGGC TC TCAAGTT C TAGAGAGC TGC CTGT GGGACAGCCCAGGGCAGCAGAGCT GGGAT GTGGTGC ATGC C T T 
TGT GT AC AT GGCC ACAGTAC AGTC T GGT CC T T T TCC T TC C CCAT C TC TTGT ACAC AT TT TAATAAAATAAGGGTTGGCT T C T 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQI LLES YPLNAHCEWT IHAKPGFVI QLRFVMLSLE FDYMCQYDYVEVRDGDNRDGQI I 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPWGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVVVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQ I S AI I LHPNYDP I LLDADI AI LKLLDKARI S TRVQP ICLAASR 
DLSTSFQESHITVAGWNVLADTOSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
S WEPTAPSD I CTAE TGG IAAVS FPGRAS PEPRWHLMGLVS WS YDKTCSHRL S TAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites . 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



m 



GGTT CCTACAT C CT CT CAT CT GAGAAT CAGAGAGCATAAT CTTCT TACGGG C CCGT GATT TATTAACGT GGCTTAATC 
T GAAG GTTCTCAGT CAAATT CT T T GT GAT CTACT GATTGT G GGGGCATGGCAAGGTT TG CTTAAAGGAGCTT GGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAG AATGA AGGCGCTTCTGTTGC 
T GGT CTT GCCTT GGCT CAGT C CT GCTAACTACATT GACAAT GT GGGCAACCT GCACTTCC TGTAT TCAGAACTCT GTA 
AAGGT GCCT CC CACTACGGCCT GAC CAAAGATAGGAAGAG GCGCTCACAAGAT GG CT GT CCAGACGGCT GT GCGAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCT GCCTAC GTGT CCT CG GCAGAGGAC GGGCAGCCAGCAAT CAGC CCAGT GGACT CT GGC CGGAGCAAC CGAACTA 
GGGCACG GCCCT TT GAGAGAT CCACTATTAGAAGCAGAT CATTTAAAAAAAT AAAT CGAGCTTT GAGTGTT CTT C GAA 
GGACAAAGAGC GGGAGT GCAGTT G C CAAC C ATGCCGACCAGGGCAGGGAAAATT CT GAAAACAC CACTGCC CCTGAAG 
TCTTTCC^GGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATC^GATCAATCGAGTAGATCCC^lGTGAAA 
GCCT CTCTATTAGGCT GGT GGGAG GTAGC GAAACCCCACT GGTC CATAT CATTAT CCAACACATTTATCGT GATGGGG 

tgatcgccagagacggccggctactgccaggagacatcattctaaaggtcaacgggatggacatcagcaatgtccctc 
W acaactacgctgtgcgtctcctgcggcagccctgccaggtgctgtggctgactgtgatgcgtgaacagaagttccgca 
gcaggaacaatggacaggccccggatgcctacagaccccgagatgacagctttcatgtgattctcaacaaaagtagcc 
\& ccgaggagcagcttggaataaaactggtgcgcaaggtggatgagcctggggttttcatcttcaatgtgctggatggcg 
?i# gtgtggcatatcgacatggtcagcttgaggagaatgaccgtgtgttagccatcaatggacatgatcttcgatatggca 
5i gcccagaaagtgcggctcatctgattcaggccagtgaaagacgtgttcacctcgtcgtgtcccgccaggttcggcagc 
Wl ggagccctgacatctttcaggaagccggctggaacagcaatggcagctggtccccagggccaggggagaggagcaaca 

m 

%f» ctcccaagcccctccatcctacaattacttgtcatgagaaggtggtaaatatccaaaaagaccccggtgaatctctcg 
M gcatgaccgtcgcagggggagcatcacatagagaatgggatttgcctatctatgtcatcagtgttgagcccggaggag 
III tcataagcagagatggaagaataaaaacaggtgacattttgttgaatgtggatggggtcgaactgacagaggtcagcc 
Ul ggagtgaggcagtggcattattgaaaagaacatcatcctcgatagtactcaaagctttggaagtcaaagagtatgagc 
1 f* cccaggaagactgcagcagcccagcagccctggactccaaccacaacatggccccacccagtgactggtccccatcct 
gggtcatgtggctggaattaccacggtgcttgtataactgtaaagatattgtattacgaagaaacacagctggaagtc 
tgggcttctgcattgtaggaggttatgaagaatacaatggaaacaaaccttttttcatcaaatccattgttgaaggaa 
caccagcatacaatgatggaagaattagatgtggtgatattcttcttgctgtcaatggtagaagtacatcaggaatga 
tacatgcttgcttggcaagactgctgaaagaacttaaaggaagaattactctaactattgtttcttggcctggcactt 
tt tt ataga atcaat gatgggt cagaggaaaacagaaaaat cacaaataggctaagaagt t gaaacac tatatt tat c 
ttgtc^gtttttatatttaaagaaagaatacattgtaaaaatgtcaggaaaagtatgatqatctaatg 

aacattgtttatattttttctattcaataaaaagccctaaaacaactaaaatgattgatttgtataccccactgaatt 
caagctgatttaaatttaaaatttggtatatgctgaagtctgccaagggtacattatggccatttttaatttacagct 
aaaatattttttaaaat gc ATT gct gagaaacgtt gctt t cat caaacaagaat aaatat tt tt cagaagttaaa 
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FIGURE 40 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDEPGLDNPAYVS SAEDGQPAI S PVDS GRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNT PKPLHPT I TCHEKWNI QKDPGE S LGMTVAGGAS HREWDLP I YVI S VE PGGVI SRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAWGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE Ai 



ACCAG GC AT TGT ATC T TCAG T TGT CAT CAAGT T CG CAAT CAGAT T GGAAAAG CTCAACT T GAAGC T T T 
CTT GCCT GCAGT GAAGCAGAGAGATAGATATTATT CACGTAATAARAAAC ATG GGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCAT TT TGGGGAAGGGAAAAACTCTGACTAAT GAAGCAT CCACGAAGAAGGTAGAACT T 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CAT CT GCAT CCCT TCC TGCAGAGGCAGCAGCT GGATTAT GGCAT CTACGTCATCCACCAGGCTGAAGG 
m TAAAAAGT T TAAT CGAGCCAAACTCTTGAAT GTGGGCTAT CTAGAAGCCCT CAAGGAAGAAAATT GGG 

||| ACT GCTTTATATT CCACGATGTGGACCTGGT ACCCGAGAAT GACT TTAACCT TTACAAGT GTGAGGAG 

111 CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
J! ¥| CCT GAAGT GGGTAAAT ATACAATGGTCTT CCACACTAGAGACAAAGGCAATGAGGT GAACGCAGAACG 

sK GAT GAAGCT CTTACAC CAAGT GTCACGAGTCT GGAGAACAGATGGGTT GAGTAGT TGTTCTTATAAAT 

g| T AGTAT C T G T GGAACAC AAT C C T T TAT AT AT CAACAT CAC AGT GGAT TTCTGGTTTGGT GC ATGACC C 

||| TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
|*| ATAGTAGCACACATTAAGAACCTGT TACAGCT CATT GTT GAGCT GAAT T TT T CCT TTTTGTATT TTCT 



111 



m 



TAG CAGAG C T C C T GGT GAT GTAGAG TAT AAAAC AGT T GT AACAAGAC AG C T T TCT T AGT CAT T T T GAT 
CAT GAGGGT TAAATAT T GT AATAT G GAT ACT T GAAGGAC T T TAT ATAAAAGGAT GAC T C AAAGGATAA 
AAT GAACGCTAT T TGAGGACTCTGGTTGAAGGAGAT T TATTTAAATTT GAAGTAAT ATATTAT GGGAT 
AAAAG GC CAC AGGAAAT AAGAC T GC T GAATGT C T GAGAGAAC CAGAGT T GT T CT CG T CC AAGG TAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCG TCC ACAAAAGAG GGGAGAAAAG G C GAC GAAT CAGGACAC AGT GAACT T GG GAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCT GAGCT GGT TGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGTT T TT GTAAAATGATTTT GTACAAGTAGGATAT GAAT TAGC AGT T TACAAGTTTACAT 
ATTAACT AATAATAAATAT GTCTAT CAAATACCTCT GTAGTAAAATGT GAAAAAGCAAAA 



APP ID=1 0063592 



Page 191 of 318 



3. - 
5 S3 



FIGURE 42 



MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEfMANFHKTLILGKGKTLTN 
EASTKKVKLDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGPCYTMVFHTRDKGNEWAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 



f*\ Important features: 

O Signal peptide: 

amino acids 1-27 



N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins . 

amino acids 191-202 
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FIGURE ax 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACG TAGA ACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAA?U^AAAAAAAAAAAAAAA&AAAAAAAAA&A^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE aa 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site, 
amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE as 



m 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
AT CGTAATAGGGAGAGAGTAGACT T CCCAGAT GGAGGC TACTCCC T GAAGCTGAGCAAAC TGAAG 
AAGAATGAC TCAGGGAT C TAC TAT GTGGGGATATACAGC TCATCACTCCAGCAGCCC TCCACC CA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CT GGAGATGGGGAGAAAGTGATAT GACC T T CATC T GCGTTGC CAGGAACCC TGT CAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 



,n GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
ffj GT TTCTGAAGAGAGAGAGACAAGAAGAGTACATT GAAGAGAAGAAGAGAGTGGACATTTGTCGGG 

? AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 

*** AGAACAATCC T AAAGGAAGATCCAGGAAATACGGT T TAC TCCACT GT GGAAATACCGAAAAAGAT 

if? 

GGAAAAT CCCCAC TCAC TGC TCACGATGCCAGACACACCAAGGCTAT T TGCC TAT GAGAATGT TA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 



0 
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r: S 



FIGURE A6 



0 Signal peptide: 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 



amino acids 1-22 



Transmembrane domain: 



fit amino acids 224-250 

m 

si 

13 Leucine zipper pattern. 

W amino acids 229-251 

**% N-glycosylation sites. 

ffj amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACC ATGA CCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCAT T CCAGCAACAACAAT G TCC T TGACAGCAAGAAAAAGAGCGT G 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
L CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
£j| ACACCAT GGCGAGT GGC TGGAGAGCAT CTAGT T T CCAC T TCGAT T CT GAAGAAAACAAACATAGG 

p CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
W CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
^ T G TAGT T TAATGGGAATAAAATGTAAGTATCAGTAGT T TGAAAAAAAAAAA 

m 

- sj 
If! 



iff 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAI PL I VSLVEEDQFSQNPI SCFEWWFPGI IGAGLMAI PA 
T TMS L TARKRACCNNRTGMFL S S FFSVI T VI GALYCML I S I QALLKGPLMCNS PSNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites . 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGC7\AATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEA^ 
SQKQHS PVPEKAI PLI T PGSATTC 



II! 



3 ; ir 



Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites, 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 



APP_ID=10063592 Page 200 of 318 



FIGURE 51 



GT GGAC TC TGAGAAGCCCAGGCAGT T GAGGACAGGAGAGAGAAGGC TGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGT GGGAGGAAGACACT C TGGAGAGAGAGGGGGC TGGGCAGAGATGAAG T TCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCT TGGACAT GGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
J n AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
f 1 TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
01 GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
ill CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

? f f 

GAAACTCAGCAGGCAGC TT TGGAAT GAATCC TCAGGGAGC TCCC T GGGGT CAAGGAGGCAAT GGA 

Vzi4 

^ GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

^ AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

|3 ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

W AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

^ . TGGCGGCAGCAGTGGTGGCAGGAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

55 CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

*W 

rp CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGT T TCCAGCAACAT GAGGGAAATAAGCAAAGAGGGCAAT CGCC TCC T TG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGAT TGGATGGGAGCCCCCACACTCCCTCCT TAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVOTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 



f$% amino acids 1-21 



N-glycosylation site . 

\\& amino acids 265-269 

m 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

0 amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATOTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAT^ACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGT CC TATATAAC GATC T TCAACAAGAGTGCAAACAT CATGCT T GACAAG T GGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAAT GCAT CT TCAGCT T T GACAGCCATTGT CAGGAGAGGCCCAGT GAATATAT T GCCACCAT 
CT T GGAGC T CAGT GCCC T TGTAGAGAAAAGAAGCCAGCATATCCT CCAGCACAT GGACT T T CT GT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATGCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACT T TGGATTT CAT TGATGTGCT TC TGCT GAGCAAGGAT GAAGAT GGGAAGG 
CAT TGT CAGATGAGGATATAAGAGCAGAGGC T GACACC T T CATGT T T GGAGGCCAT GACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCT TCTGAAGGACCGCGATCCTAAAGAGATT GAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
' GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCAT CCACC TGTT T T T T T GCAGAT T GT CATGAATAAAACGGTGCT G T CAAA 
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FIGURE *A 

MSLLSLPWLGLRPVAMS PWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLS WLYNLARHPEYQERCRQEVQELLKDRDPKE I EWDDLAQLP FLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
lAFIPFSAGPRNCIGQAFAMAEMKWIiALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 {type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites . 

amino acids 112-116, 168-172 
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FIGURE ss 



in 



in 
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AT C GCATCAAT T GGGAGTACCATCT TCC T C ATG GGACCAGTGAAACAGC TGAAGCGAATGT TT GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAA T T CAT GGCCAGT T T TATGAAGCTT TGGAAGGCAC TATGGACAGAAGC TGGTGGACAGTT T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGAT TCGAACAT T T GAGGGT TACT T TT GGAAGCAACAATACATTCT CGAACC T GAAT GTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGATGTT GT CCCAC TGAAT T CCCATGAATACAAACC TAT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 56 

MGPVKQLKRMFEPTRL I AT IMVLLC FAL TLC SAFWWHNKGLAL I FC I LQS LALTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE R7 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGT GC CGGTCAT CGGGGAAGCC CAGAC C GAGTTCCAGTACTTT GAGT CGAAGGGGCT C CCT GCCGAGCT GAAGTCC 
ATTT TCAAGCTCAGT GT CTT CAT C CCCT C CCAG GAATT CT CCAC C TACC GCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTT GAT GGGCAGCT AGACTT T GAAGAATTTGTC CAT TATCT C CAAGAT CAT GAGAAGAAGCT GAGG 
CT GGT GTTTAAGATT TT G GACAAAAAGAAT GAT GGAC GCATT GAC GC GCAGGAGATCAT GCAGT CC CT GCGGGACTT G 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATTCT CAAGAGCAT GGATAAAAACG GCAC GAT GACCATCGACTGG 
AACGAGT G GAGAGACT AC CACCT C CTCCAC CC CGT GGAAAACAT CCC CGAGAT CAT CCT CTACT GGAAGCAT T C CACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACT CT GGC GGG GCAAT GG CAT CAACGT C CT CAAAATT GCCCCC GAAT CAGC CATCAAATT CAT GGC CTAT GAGCAG 
AT CAAGC GCCTT GT T GGT AGTGACCAGGAGACT CT GAGGATT CACGAGAGGCT TGT GGCAGGGTCCTT GGCAGGGGCC 
AT C GCCCAGAGCAGCAT CTACCCAAT GGAGGT CCT GAAGACCCGGAT GGCGCT GCGGAAGACAGGCCAGTACT CAGGA 
AT G CT GGACT GC GCCAGGAGGAT CCTGGCCAGAGAGGGGGTGGCC GC CT T CTACAAAGGCT AT GT C CC CAACAT GCT G 
GGCATCAT CCCCTAT GCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTAT GCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCT GT GAGCATCAGCTAC GT G GT CTAC GAGAACCTGAAGATCACC CT GGGCGT GCAGT CGC GGTGACGGGGGGAGGGC 
CGCC CGGCAGTGGACT C GCT GAT CCT GGGC CGCAGCCT GGGGTGT GCAGCCAT CT CAT TCT GT GAAT GTGC CAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGC^GACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCT GGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACT GTGGCAT GAGGGCAGTGGAGCA 
C CATGT TT GAGGGCGAAGGGCAGAGC GTTT GT GTGTT CTGGGGAGGGAAGGAAAAGGT GTT GGAGG C CTTAATTAT GG 
ACT GT TGG GAAAAGGGTTTT GT C CAGAAGGACAAGCC GGACAAAT GAGCGACTTCT GT GCT T C CAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
G GGACCAGCCC CACATT CCACTT GT GT CACT G CT T G GAAC CT ATTTATTTT GT ATTTATTT GAACAGAGTTAT GT CCT 
AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTT CCT GCGCTGCGAGGGTTTCTTTATTT CACT CTTT TCT GAAT GTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAATCTT CAATAGGAT GCAAAGATCAAT GCAAAAAT T GTTAT ATAT GAACATATAACT GGAGT CGT CAAAAAG 
CAAATTAAGAAAGAAT T G GACGTTAGAAGTT GT CATTTAAAGCAG CCT T CTAATAAAGTT GTT T CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKI LDKKNDGRI DAQE IMQS LRDLGVKI S EQQAEKI LKSMDKNG 
TMT I DWNEWRDYHLLHPVEN I PE 1 1 LYWKHS T I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGIWLKI 
FMAYEQIKRLVGSDQETLRIHERLVAGSIiAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKGYVPNMLGI I PYAGI DLAVYETLKNAWLQHYAVNS ADPGVFVLLACGTMSS T C 
GQLAS Y PLALVRTRMQAQAS I E GAPE VTMS S L FKH I LRTEGAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE SQ 

GGAAGGCAGCGGCAGCT CCACT CAGCCAGTAC CCAGATACGC TGGGAACC TT CCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCAT T GGC T TT GGTAT T T CAGGGAGACAC T CCATCACAGT CAC TACTGTCGCCTCAGC T 
GGGAACAT T GGGGAGGATGGAAT CC TGAGCTGCAC T TT TGAACCT GACAT CAAACT T T C TGATAT 
CG T GATACAAT GGCT GAAGGAAGGT GT T T TAGGC T T GGTC CAT GAGT TCAAAGAAGGCAAAGAT G 
AGCT GTC GGAGCAGGAT GAAAT GT T CAGAGGCCGGACAGCAGT GT TT GC TGAT CAAGTGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CAC T TC T AAAG G C AAGGGGAAT G C TAAC C T T GAGTATAAAAC T G GAGC C T T C AGCAT GCC G G 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGT CTGGGCAT CCCAAGT TGACCAGGGAGCCAACT T CT CGGAAGT CT CCAATAC 
CAGC T T T GAGC T GAACT CTGAGAATG TGACCAT GAAGGT T GT GTC T GTGC TCTACAAT GT TACGA 
T CAACAACACATACT CC TGTAT GAT T GAAAAT GACAT TGCCAAAGCAACAGGGGAT AT CAAAGT G 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACC TAGT TT TAT AT T TCTGGGAGGAAAT GAAT TCATAT C TAGAAGT C TGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CT AT CT T CAAAGACATAT TAGAAGT T GGGAAAATAAT TCATGT GAACTAGACAAGT GTGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T GT AATGT TGC T CT GAGGAAGCCCCT GGAAAGT CTAT CC CAACATAT CCACATC T TAT AT T CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAA.TGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T CTT CCCAACT GACAAATGCCAAAGT TGAGAAAAAT GAT CATAAT T T TAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAA 
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FIGURE 60 

MASLGQILFWS I IS 1 1 1 ILAGAIALI IGFGI SGRHS I TVTTVASAGNIGEDGILSCT FEPDIKLS 
DIVIQWLKEGVLGLVHE FKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLY1WTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCC T T CCGGGAGAAGGACACAGCT GTACAAGGCAACATCCGGCTCAGC T T CGAGGAC T TCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 

CTGACCAAGCAT GAGAGAGATC T GTC TATGGGACCAGTGGCTTGGAT T CT GCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT TT GGGGCCAAAAG TCCAGTGAAAT T GTAAGC T TCAATAAAAGGAT GAAAC TCT GA 



FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGY I SMKELKQALVNCNWS S FNDE TCLMMINMFDKTKS GRI DVYGFSALWKFIQQWKNL FQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RL S FE D FVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6s 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 

TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 

AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 

TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 

GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 

GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 

AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 

TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 

ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 

GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 

CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 

GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 

TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

TGC T CAT CTCCAATAAAATAAAAGCACT TATGAAAAAAAAAAA2^^ 

AA?VAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 6 A 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGVVEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDl^VPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPA^ 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 



AAGGAGAGGCCACCGGGAC T T CAGTGT CTCC T CCATCCCAGGAGCGCAGTGGCCACTATGGGGT C 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLAR^^ 

LIJU^LLAVGIALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 



amino acids 56-80 



N-glycosylation site. 



amino acids 3 6-4 0 




cAMP- and cGMP-dependent protein kinase phosphorylation site. 



amino acids 86-90 



Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 



N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACT AGAGAGGAACAATGGGGTTAT T CAGAG GTTTT GTTTT CCTCTTAGTTCTGT GCCTGCT GCAC CAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCTT CTAC GTAC CT GTTT GAAGCCACAGAAAAAAGA 
T TT TTTTT CAAAAAT GT AT CTATATTAATT CCT GAGAATT GGAAGGAAAAT C CT CAGTAQAAAAGG C CAAAACAT GAA 
AACCATAAACAT GCT GATGTTATAGT T GCACCACCTACACT CCCAGGTAGAGAT GAACCAT AC ACCAAGCAGTT CACA 
GAAT GTGGAGAGAAAGG CGAATACAT T CACT T CAC CCCT GACCTT CT ACT T GGAAAAAAACAAAAT GAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
T ACCGT GCTAAGTCAAAAAAAATCGAAGCAACAAGGT GTT CCGCAGGTAT CT CT GGTAGAAATAGAGTTTATAAGT GT 
CAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATGGAAAAGATTGTCAATTCTTT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAAT GTTTAT G CAAAGTATTGAT TCT GTTGT T GAATTTTGTAAC GAA 
AAAACCCATAAT CAAGAAG CT CCAAGCCTACAAAACATAAAGTGCAATTT TAGAAGTACAT GGGAGGTGATTAGCAAT 
T CTGAGGAT TTTAAAAACACCATACCCAT GGT GACAC CACCT CCT CCACCT GTCTT CTCATT GCT GAAGAT CAGT CAA 
AGAAT TGT GT G CTTAGTTCT T GATAAGTCT GGAAGCATGGGGGGT AAGGACC GC CTAAAT C GAATGAAT CAAGCAGCA 
AAACATTTC CT GCT G CAGACT GTT GAAAAT GGAT CCT GGGT GGGGAT GGT T CACT T TGATAGTACT G CCACTATT GTA 
AATAAGCTAAT CCAAATAAAAAGCAGT GAT GAAAGAAACACACTCAT GGCAGGATTACCTACATATC CT CT GGGAGGA 
ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
T TTATTGCTTT GGGAAGAGCT GCT GAT GAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGT CATTT TTAT GTT 
TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
T C CCTTCAGC T CGAAAGTAAGGGATT AACACT GAATAGT AAT GC CTGGAT GAAC GACACT GT CATAATTGAT AGTACA 
GT GGGAAAGGACACGTT C TTT CT CAT CACAT GGAACAGT CT GCCT CC CAGTATT TCT CT CT GGGAT CCCAGT GGAACA 
ATAAT GGAAAATTT CACAGT GGAT GCAACTT C CAAAAT G GCCTAT CT CAGTATT CCAGGAACT GCAAAGGTGGGCAC T 
T GGGCATACAATCTT CAAGC CAAAGCGAAC CCAGAAACAT TAACTATTACAGTAACTTCT CGAGCAGCAAAT TCTT CT 
GT GCCT C CAATCACAGT GAAT GCTAAAATGAATAAGGAC GTAAACAGTT T CC CCAGCCCAATGAT T GTT TAC GCAGAA 
ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
T T GGAACT TTT GGAT AAT G GT GCAGGCGCT GATT CTTT CAAGAAT GAT GGAGT CTACTC CAGGTATTT TACAGCATAT 
ACAGAAAAT GG CAGATATAGCT TAAAAGTTCGGG CTCAT GGAGGAGCAAACACT GCCAGG CT AAAATTACGGC CT CCA 
CT GAATAGAGCCGC GTACATACCAGGCTGG GT AGT GAACGGGGAAATT GAAGCAAACCCGCCAAGACCT GAAATT GAT 
GAGGATAC T CAGACCACCTT GGAGGATTT CAGCCGAACAGCAT C C GGAGGTGCATT T GT GGTAT CACAAGTCC CAAGC 
CT TC CCTTGCCT GAC CAATACCCAC CAAGT CAAAT CACAGACCTT GATGCCACAGTT CAT GAGGATAAGAT TAT T CT T 
ACATGGAC^GC^CC^GGAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGT TTT GAT GAT GCT CTT CAAGTAAATACTACT GAT CT GT CAC CAAAGGAGGCCAACT CCAAGGAA 
AGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCCACATATTTATTGCCATTAAAAGTATAGATAAA 
AGCAATT T GACATCAAAAGTAT C CAACAT T GCACAAGTAACTTTGTTT AT CC CT CAAGCAAAT C CTGAT GACAT T GAT 
CCTACACCTACT CCTACT C CTACT C CT ACT CCT GATAAAAGT CATAATT CTGGAGTTAATAT T T CTACGCT G GT ATT G 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATT TGAA CCTTAACGAAGAAAAAAATCTTC 
AAGTAGACCTAGAAGAGAGTTT TAAAAAACAAAACAAT GTAAGTAAAG GATATTT CT GAAT CT T AAAAT TCAT CCCAT 
GTGT GAT CATAAACT CAT AAAAATAATTT TAAGAT GT CGGAAAAGGATACTTT GATTAAATAAAAACACT CAT GGAT A 
TGTAAAAACT GTCAAGATTAAAATT TAATAGTTT CATTTATTT GT TATT T TATTT GTAAGAAATAGT GATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATT AAAAT CAT CTAT CT GAGTAGT CAAAATACAAGTAAAGGAGAG CAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKOTSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDS TTKLYGKDCQFFPDKVQTEKAS IMFMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVI SNSEDFKNT I PMVTPPPPPVFSLLKI SQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPS GT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITWAKMNKDWSFPSPMIVYAEILQGWPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRY 1 1 RI S AS ILDLRDS FDDALQVNT TDLS PKEANSKE S FAFKPEN I SEENATH I FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 

VNFILSTTI 



W Signal peptide: 

*t\ amino acids 1-21 



Putative transmembrane domains: 

amino acids 284-300, 617-633 



Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCTT AGGT GGAAACCCT GGGAGTAGAGTACT GAC1AGCAAAGACCGGGAAAGACCATACGTC CCCG GGCAGGGGTGA 
CAACAGGTGT CATCT TTTTGATC TCGT GTGT GGCT GCCTTCCTATT T CAAGGAAAGACGCCAAGGTAAT TT T GACC CA 
GAG GAGCAAT GATGTAGCC AC CT CCTAACCT T CCCT TCTT GAACCCC CAGTTAT GCCAGGATTTACTAGAGAGTGT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCTGCT CTGT GT GGT GGT TAACT C CAAGAGGCAGAACT CGT T CTAGAAGGAAAT GGAT GCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACTGCCCAGGGCCAACAG CCCCACGG GGAAGGAGGGGTACCAGGCC GT CCTT CAGGAGT G GGAGGAG CAGCAC 
CGCAACTACGTGAGCAGCCTGAAGCGGCAGATCGCAC^GCTC^GGAGGAGCTGCTVGGAGAGGAGTGAGC^GCTCAGG 
Wi AAT GGGCAGTACCAAGCCAGCGAT GCT GCT GGCCT GGGT CTGGACAGGAGCCCC CCAGAGAAAACCCAGGCCGAC CT C 

f*| CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
AGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAA 
GAGCTCACCTTGAAAGGGGACCACAAACACGAATTCAAACGGCTC^^ 

GT GAAAAATGAAAAGCTCAACAT GGC CAACACGCTTAT CAAT GTTAT CGT GCCT CTAGC AAAAAGGGT GGACAAGT T C 
CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 
AAAGAAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTCAGGAACTTTACCTTCATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTATTTTAT CCAGTTCTTTTCAGT CAGT ACAAT CCT GGCAT AATATACGGCCAC CAT GAT GCAGT C CCT CC CTTGGAA 
CAGCAGCTGGT CATAAAGAAGGAAACT GGAT T TT GGAGAGACTTTGGATT T GGGAT GAC GT GT CAGTAT CGGTCAGAC 
TTCATCAATATAGGTGGGTTTGATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTC 
CACAGCAACCT CATAGTGGT ACGGACGCCT GT GCGAGGACTCTT CCAC CT CT GGCATGAGAAGC GCT GCAT GGACGAG 
CT GACC CCC GAGCAGT ACAAGAT GTGCATGCAGT CCAAGGCCAT GAACGAGGCAT C CCACGGC C AGCT GGGCATGCT G 
tjj GTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATQftACTCCCAGA 



111 



¥■1 



a 

m 



GAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
G GAC GACAAAAGAATT GGACTGAT GGGT CAGAGAT GAGAAAGCCTCC GAT T T CT CT CT GT T GGGCTTT TTACAACAGA 
AAT CAAAAT CT CCG CTTT GCCT GCAAAAGTAAC CCAGT T GCACCCT GT GAAGTGT CTGACAAAGGCAGAAT GCTTGT G 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AAT ATT CAT GATTTAAGAGCAGTTTT GT AAAAAATT CATTAGCATGAAAGGCAAGCATATTTCT CCT CATAT GAAT GA 
GC CT AT CAGCAGGGCTCTAGTT T CTAGGAAT G CTAAAATATCAGAAGGCAGGAGAGGAGATAGG CTTATTATGATACT 
AGT GAGTACATTAAGTAAAATAAAAT GGAC CAGAAAAGAAAAGAAAC CAT AAATAT CGT GT CATATTTT CC CCAAGAT 
TAACCAAAAATAAT CTGCTTAT CT TTTT GGTT GT C CTTT TAACT GT CT CC GT TTTT T T CT TTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
AT T TTTATATTT TTTAAGAAGATACTT T GAGATGCATTAT GAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
CCAAGGACAT GCCAAATGCTGAT TCT GTCAGGCACT GAATGT CAGGC ATT GAGACATAGGGAAGGAAT GGTT T GTACT 
AATACAGACGTACAGATACTTT CT CT GAAGAGTATTTT CGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAAT GAC 
ACT TT CT GCT TT ACAGAAAAGGAAACT CATT CAGACTGGT GATATCGT GAT GTACCT AAAAGT CAGAAACC AC ATTT T 
CT CCT CAGAAGTAGGGAC CGCTTT CTTACCT GTT TAAATAAACC AAAGTATACCGTGTGAACCAAACAATCTCTTT TC 
AAAACAGGGT GCTCCTCCT GGCTT CT G GCTT CCATAAGAAGAAATG GAGAAAAATAT ATATATAT AT ATATAT ATT GT 
GAAAGAT CAAT CCAT CTGC CAGAAT CTAGT GGGAT GGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACTGAATTATTTTTTAAATTAAGCAGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTT 
CAAACTAT TTT TTAAAAAT AAATACAGTT AACATAGAGT GGTTTCTTCATTCATGTGAAAATTAT TAGC CAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
ATT CAAGCTGTT GGT GTGTT AAAAAAT G CAT T GTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCATGAATGGAAGGT GGT ATTGCACAGCTAATAAAAT AT GATT T GT GGAT AT GAA 
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FIGURE 72 



MMMVRRGLLAWI SRVWLLVLLCCAI SVLYMLACT PKGDEEQIALPRRNSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDS FTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALE TLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLN]^ 
TLINVIVPIAKRVDKFRQFMQNFREMCIEQ 

TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I WRTPVRGLFHLWHEKRCMDELT PEQYKMCMQSKAMNEASHGQLGMLVFRHE I EAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-10.2, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
G AAAC C CAT GAT AC C C T AC T G AAC AC C G AAT C C C C T G G AAG C C C AC AGAG AC AG AG AC AG C AAG A 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
£jj GGAAACAATGCCCAGTCGCCCAT CGATATTCAGACAGACAGT GT GACAT TTGACCCT GATT TGCC 

J*l TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
B! ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
III GCCCAGC TCCACCTGCAC TGGGGT CAGAAAGGAT CCCCAGGGGGGT CAGAACACCAGAT CAACAG 

ill 



T GAAGCCACATT TGCAGAGCT CCACAT T GTACATTATGACTCTGATT CCTATGACAGC T TGAGT G 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCTTATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCA7VACAGCTGGGGCAGTACTTCCGCTACAATG 



GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT TT CAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGT TCTCCACAGAAGAGGAGCCCTCTAA 
]K GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
ffl TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGC ATAAA TTCCTTCTCAGATAC 
CAT GGAT G T GGAT GAC T T CCC T T CAT GC C TAT C AG GAAGC C T C TAAAAT GGGGT G T AGGAT C T GG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGC T GT CAT TCCAGGAAGAAC TGCAGAGCCT TCAGCC TCT CCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

ML FSALLLEVT W I LAADGGQHWT YEGPHGQDHWPAS YPECGNNAQS P I D IQTDS VT FDPDL PALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT S AQAT TEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosyla tion si te . 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TC T GGT GGT T T GCCTAAACCT GCAAACATCACCT T CT TATCCATCAACATGAAGAATCT CC TACA 
AT GGAC TCCACCAGAGGGTCTTCAAGGAGT TAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GAT T TT GAT T TAT GGAAATGAAT T T GACAAAAGAT T CT TTGTGCCTGCT GAAAAAAT CGT GAT TA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGT GAT GTATCCAGCC TTAATGATCC TCAGCCCAGCGGGAACC T GAGGCC CCC TCAGGA 
GGAAGAGGAGGTGAAACATT TAGGGT ATGCT T CGCATT TGAT GGAAAT T T T T T GTGAC TC TGAAG 
AAAACACGGAAGGTAC T TCTC T CACCCAGCAAGAGT CCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGTCATT GAATAT GAATAT GAT GTCAGAACCACTGACAT T T GT GCGGGGCC TGAAGAGCAGGA 
GCT CAGT T TGCAGGAGGAGGT GTCCACACAAGGAACAT TATTGGAGT CGCAGGCAGC GT T GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGAC TCGGAGGAGGGGCCGGAGGAAGAGCCAT CGAC GACCC T GGT CGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGAGCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 

DQSSE FKAK 1 1 FWYVL P I S I TVFL FSVMGYS I YRY IHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites . 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 

T CGT GGAGT T CCAC AT GACGACT GAGGCCCAAGCC ACCATC CGCAT GGAC ACC AGT GCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TTACACCAAAGGTGACCAACT TATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCT G 
ATGAACTC TGGGATTGGCTGGT TCCAACCTGATGTT CTGAAAAACAT CATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITMILQLQWPSMDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVKVP I SLS IDRLEFDLLYPAIKGDT IQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 

FEAAE S S L T KDALVLT PAS LWKP S S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCCT GGCAGAGAGAC TC TGAAAT GAGGGAT TAGAGGTGT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCC TC T GGAT GGAAT GT GCCACACACAGCACAGGCATCAC CCAGTGT GACATC TATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC. 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

GT C AGAAGG TGC T GCT GAGGAT AGACTGACT T T GGCCAT TGGAT T GAGCAAAGGCAGAAAT GGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCT CCAAAGAAACT GAT T GGCCC T GGAACC TCCAT C CCAC T C T TGT TAT GAC T CCACAGT GT CCA 
GACTAATT TGTGCAT GAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCC TGGGACAT T TAAAAAAAT A 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 

I YSTLLGLPADI QAAQAMMVT SSAI S SLAC 1 1 S WGMRCT VFCQESRAKDRVAVAGGVFF ILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 

DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
j** CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
W ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

m 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
Zl TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
HI AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
l8 ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
W ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGAT GAGGCAAGT GAGCAATCGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
||j AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
O CAGACCCAAGGGAGCAGGCAGCACAACT GTCAAGAT CGTCC T GAAGGAGAAACATAAGAAAGCC T 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
G T GAAGT AC C T GGCCCAAGGCC ACACAGCCAGAAT C T T CCACT TGACTCAGATCAAGAAAG TCAG 
GAAGCAAGAC T TCCAGAAAGAGGCACAGCAC T TCCGAC T GC TCGCT GGCCCCCACGAAGGTCACT 
GGAACGTC TT CCTAGCCCAGACCCT GGAGCTGAAGGTCACGGCCAG T CCAGACAAAGT GACCAAG 
ACATAACAAAGACC TAA CAGT TGCAGATATGAGCT GTATAAT T GTT GT TAT TAT AT AT TAATAAA 
TAAGAAG T TGCAT TACC C T CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPaPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 



APP_ID=10063592 



Page 232 



FIGURE 83 



GACAGCT GT GTCT C GAT GGAGT AGACTCTC AGAACAGCGC AGTTTGCCCTCCGCT C AC GC AG AGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 

GACGC AGC AT T CTCCT GTT T CC TGTCTCCT AAGACCAATGCAGAGGCC AT GGAAGTGC GGTTCT TC AGGG G 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGC AGGACAAAACT GGT GAAGGAT TCT ATT GCGG AGGGGCGC AT CTCT CT GAGGCT GGAAAACATTACT 
GT GTT GGATGCT GGCCT CT ATGGGTGCAGG AT TAGTT C CC AGTCTT ACT ACC AG AAGGCC AT CTGGGAGC T 
AC AGGTGT CAGC AC TGGGCT CAGTT CCT CTCATT T CCATC ACGGGAT AT GTT GAT AGAGAC AT C CAGCT AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TT TTTTGG CATT GT TGGACT GAAG AT TTTCTT CTCC AAAT TCCAGT GGAAAATCCAGGCGGAACT GGACT G 
GAGAAG AAAGCACGGACAGGCAGAAT TGAGAG ACGCCCGG AAAC ACGCAGTGGAGGTGACT C TGGAT CCAG 
AG ACGGCTCACCCGAAGCTC TGCGTT TCTGAT CTGAAAACT GTAACCC AT AGAAAAGC TCCCCAGGAGGT G 
CCTCACTCTGAGAAGAGATT TACAAGGAAGAGTGT GGT GGCTTC TCAGAGTT TCC AAGCAGGGAAACATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT ACGTGACTTTGTCT CCCGAT CATGGGTACTGGGTCCTC AGACTGAATGGAGAACATTTGTATTT C 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGAC C ATCT CCTTCT T CAAC AT AAATGACC AGTCCC TT ATTT AT ACCC TGAC AT GT CGGT TT G 
AAGGCT T ATT GAGGCCCT AC ATTG AGT AT C CGTCCT AT AATG AGC AAAAT GGAACT CCCAT AGT CATC TGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCT T ATTGAT GAC AGAGTGT AT C CT AATGGT TT GT TC ATT ATATT AC ACTTTC AGT AAAAAAA 



FIGURE 8a 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I TGYVDRD I QLLCQS SGWFPRPT AKWKGPQGQDLSTDSRTNRDMHGL FDVE I S L 
TVQENAGS I S CSMRHAHLS REVE SRVQI GDT FFE P I SWHLATKVLG I LCCGL FFG I VGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PI VI CP VTQE SEKE AS WQRAS AI PETSNSESSS QATT P FL PRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 



APP_ID=1 0063592 



Page 234 



FIGURE 8* 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 

GCTCT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAAC TGCT GACGAT GCAGAGT TCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGACCCACATACCAAGAAT TGCACCC T GAGCAT CAGAGAT GCCAGAAGAAGT GATGC GGGGAGA 

TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCAGAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCGCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCT T GGGACT CGCGGGGACAGGAGGCCAC TGACACCGAGTACT CGGAGAT CAAGATCCACAG 

ATGAGAAACT GCAG AGAC TCACCCT GAT TGAGGGAT CACAGCCCCTCCAGGC AAGGGAGAAGT CA 

GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATQAATTATG 

T GCAGAGT GAAAAGCACACAGGCT T T AGAGTCAAAGTATC TCAAAC CTGAATCCACACTGT GCCC 

T CCCT T T TAT T TT T T TAAC TAAAAGACAGACAAAT T CCTA 



FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
H DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

:J? YSEIKIHR 

||| Signal peptide: 

III amino acids 1-15 

m 

Transmembrane domain: 

P amino acids 351-370 

If : 

?£; 
5 1"? 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
T CATAGCGACCACCAGAGGATGGAGT ACAGAT GAGGC T AAT ACT TACT T CAAGGAAT GGACC T GT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GAT ACT T TCCAGAGGCCAGTCCCCAGCAGT GT GGAGAT TTTTCTGGTTT TGAT TGGAGT GGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGA GAGTT T TGT GGGAGGGAACCCAGACCTCTCC TCCCAACCATGAGATCCCAAGGAT GGAGAA 
CAACT TACCC AGT AGC T AGAATGT T AATGGCAGAAGAGAAAAC AAT AAAT CATAT T GAC T CAAG A 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAA.T 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCI GGGGYFPEAS PQQCGDFS G FDWSGYGTHVGYS S S RE I TEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites . 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



If! 
St?* 

ri 

lit 



CTAGAT T T GT CGGC T T GCGGGGAGACT TCAGGAGT CGC T GT C T CT GAAC T TCCAGCC TCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T T AAGAC T T AT AT AC AGT T T T AGG GGACAAT T AAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSL I S I GAHCSASVALS FFI FERWECTTYWYI FVFCSALPAVTEMALFV 

TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 



V 
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FIGURE Ql 



C T G GGACC C CGAAAAGAGAAGG GGAGAGCGAG G GGACGAGAG C GGAGGAGGAAGATGC AAC T GAC 

TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 

GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 

CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 

GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCAGAGCCCCC 

CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 

GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 

CCAACAC AAT GCCACAGGCCAGGGAAACATC T C CAT CAGCC T CGT GCCCCCCAGT AAAGC TGTAG 

AGT TCCACCAGGAACAGCAGATC T TCATCGAAGCCAAGGCC T CCAAAAT C TT CAACT GCCGGAT G 

GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCT CGCT T TGCACCCACGACCCAGCGAAGATC T G 

CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCT T GGT TCT TGCCATCC TGAGGAAAGATAGCAACAGGGAGGGGGAGAT T TCAT CAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 

CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 

CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCAT CCT TAAGCTAAGACAGGACGAT TGTGGT CC TCC CACAC TAAGGCCACAGCC CAT C 

CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 

GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 

CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 

TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
STOFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTGTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCAC T TCCCAAACCGCAGACTACATC TT TAGAGGAAGCACAACT GTGCCT 
TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE QA 



MTAAVFFGCAFI AFGPALAL YVFT I AIEPLRI I FL I AGAFFWLVSLL I S SLVW FMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 



U\ Transmembrane domains: 



amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE QS 



AATT TT TCACCAGAGTAAAC TT GAGAAACCAACT GGAC CT T GAG TAT TGTACAT TTTGCCTCGTG 
GAC CCAAAGGT AGCAAT C T GAAACATGAGGAGT ACGAT TC TACT GT T T TG TC T T CTAGGAT C AAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTT TCCT TCTT TAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATTGATTATATTATGGAATAGATTGAGACACATTGGATAGTCTTAGAAGAAATTAATTCTTAATT 
TACCT GAAAAT AT TCT T GAAAT TT CAGAAAAT AT GT TC TAT GTAGAGAATCCCAAC T T T TAAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
TAAAACATAT TT GGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRST.ILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



3 SS 

ill 

HI 



Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
\*k GGATGAGCCAGGCACAGCCAACT CGGATCT T CGAT TCCACAT CC T GAGCCAGGCTCCAGCCCAGC 

f*| CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
i{ ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
|jf CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
|f| CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
f II TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGAC^,GAGAAGCCCAGGCTGAGTACCTGCTCC 
7fi AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
)** ATGGATGAGAAT GACAACG T GCC TAT C T GCC CT CCCCGTGAC CCCACAGTCAGCAT CCC T GAGCT 

Hi CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
Si ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
Q CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
l$k CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
Sf TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
|l| CC T CGAGC CCGCCT T CCGCC T CATGGAT TT TGCCAT TGAGAGGGGAGACACAGAAGGGAC T T T TG 

fjl GCC T GGAT TGGGAGCCAGAC TC T GGGCATGT TAGACT CAGACTC T GCAAGAACCT CAGT TAT GAG 

fSg GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
,? * AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T TCACCCAC T GGACCAT GTCAAGGAAGAAGGACCCGGAT CAACCAGCAGACAGCGT GCCC C T 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCC T GGGAGAGAGC CCAGCAC CCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGI PFLFLEASDRDEPGTANSDLRFHI LSQAPAQPS PDMFQLEPRLGALALS PKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEAS VPI SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTIALHWVEP 
REHI I PWVSHNAQMWQLLWVI VCRCNVEGQCMRKVGRMKGMPTKLSAVGI LVGTLVAI GI FL I 
LIFTHWTMSRKKDPDQPADSVPLKATV 



Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 7 62-784 
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FIGURE QQ 

GGCTGACGGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGAT T CT TCCCAGGAACACAAACGTAGGAGACCCACGC TCC T GGAAGCACCAGCCT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCAT C T TGC T TTCCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 

T C TGCCAACAC T GGAT CCAGTG T GATC TCCAGT GGAGC CAGCACAGC CACCAACT CTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACT C T GAGTTCCATACAACCTCCAGT GGGAT CAGCACAGCCACCAAC T C T GAG 
TTCAGCACAGCGT CCAGT GGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
%,i GGCCAGCACAGCCACCAAC T CT GAGT CCAGCACAC CC TCCAGTGGGGCCAGCACAGTCACCAACT 

ft* CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
lH AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
M CAAC TC TGAC TCCAGCACAACCT CCAGTGGGGC TAGCACAGCCACCAAC TC T GAGT CCAGCACAA 

CP*- CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
||| GCCACCAACT CT GAGT CCAGCACAACC TCCAGT GGGGC CAGCACAGCCACCAACT CTGAGTCCAG 

li% AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
%l GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCT CCAGT GGGGCCAGCACAGCCACCAACTC T GAGTCCAGCACGACC T CCAGT GG 
111 GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
21 C TGAGTCCAGC ACAGT GTCCAGTGGGATCAGCACAGT CACCAAT TC TGAGTCCAGCACACCCT CC 

m AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
j J CAAC TC TGAGTCCAGCACAGTGT CCAG TGGGGCCAGCACTGCCACCAAC TC T GAG TCCAGC ACAA 

%: CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
U GCCACCAAC T CT GACT CCAGCACAACC TCCAGT GAGGC CAGCACAGCCACCAACT CTGAG T C TAG 

i/i CACAGT GTCCAGTGGGATCAGCACAGT CACCAAT T C TGAGT CCAGCACAACC T CCAGT GGGGCCA 

f\ ACACAGCCACCAACTCT GGG T CCAGTGTGACCT C TGCAGGC T C TGGAACAGCAGC T C T GACT GGA 

sis ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCC TGAG CAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C TTGAAGAAGG T AT TCCT CACC T TTCTTGCCTT TACCAGACAC TGGAAAGAGAATACTATAT 
T GCTCAT T TAGC TAAGAAATAAATACATC TCAT C TAACACACACGACAAAGAGAAGCT GT GC T T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATC TGGCAT TCAAAAT C T C CACAGTAAAAT CCAAAGACCT CAA&A?AAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE lOQ 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTT TCTAGCTACTGCACGTACT TCATAAAGCAGGACTCTAAAAGCT TTGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

GAT TATAACTGGGGATGCAT T TGT TCC T GGAGAAAGAAGTGT CAT TATCATGAACCATCGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
ATATTCACGAACCACTT CAACTCCTCATAT TCCCAGAAGGGACTGATCTCACAGAAAACAGCAAG 
TCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 
AAC TACAGGCT TT ACTT T TGTGGTAGACCGTCTAAGAGAAGGTAAGAACC TT GATGCT GT CCAT G 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAAT CCACT TT CACGTCCACCGGTATCCAATAGACACCCTCCCCACAT CCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 

GGGAGAAGAAT T T T TAT T T T ACCGGAC AGAGT G T CAT T CCACCT TGCAAGT C T GAAC TCAGGGT C 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T TAAAT TCAAAGAAAAAT GAG TAAGAT T AT AAGGT T T GCCAT G T GAAAACC TAGAGC AT AT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TAT TT GT TAAAGATAT T TTGCAC T TAAT TT TGTGGGAAAAA.TAT T GCTACAAT TT TT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 

GGGCCAGAATATTATTAAACAATCATCAGGC TT T TAAA 



FIGURE 1Q2 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVIITGDAFVPGERSVIIMNHRTRMDTO 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 

YPHNI PQSEKHLLQGD FPRE IHFHVHRYP IDTLPTSKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 

LEIIELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains : 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T C C AAAT CAT C CAT C C AC CCCTGCTGT CAT C T G T T T T CAT AG T G T G AGAT CAACC C AC AGG AAT A 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGAT T TG T C T T CAGACT CCAGAGCAAAT GCAGATGGGT ACAGCC TGTATGATGT GGAGATCT CC 
AT TATAGT CCAGGAAAATGCTGGGAGCATAT TGT GT T CCATCCACC T T GCT GAGCAGAGT CATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATCCAAAGGGAAAAT CCAGGCGGAACTGGACT GGAGAAGAAAGCACGGACAGGCAGAAT T GAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
T TTACAAGGAAGAGTGTGGT GGCT TCTCAGGGT TTCCAAGCAGGGAGACAT TACT GGGAGGT GGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGT GT GT CGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACT TTGT CT CCCAACAAT GGGTATT GGGT CCT CAGACTGACAACAGAACAT T TGTAT T T C 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
CC CACACCACAGAC CCAGACACAGCCAAGGGAGAGTGCT CCCGACAGGTGGC CCCAGCT T CC T C T 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TCC T GCC T CACAGGT GAAGATTAAAGAGACAACGAAT GTGAATCAT GCT T GCAGGT T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCTTAT TCCACATTAATTTACTT TTCTCTATACCAAAT CACCCATGGAATAGT TATTGAACACC 
TGCTT TGTGAGGCT CAAAGAATAAAGAGGAGGTAGGATT TT TCACTGAT TCTATAAGCCCAGCAT 
TAC C T GATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGT CCATAT CC 
CTCAT TAACACAGACACAAAAAT T C TAAATAAAAT TT TAACAAAT TAAACTAAACAATATAT TTA 
AAGAT GAT ATATAACTACT CAGT GTGG T TTGTCCCACAAATGCAGAGT T GGT T TAAT AT TTAAAT 
AT CAACCAGT GT AAT T CAGCACAT TAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 

VAALGSLPL I S I VGYVDGG I QLLCLS S GWFPQPT AKWKGPQGQDL S SDSRANADGYS LYDVE I S I 
IVQENAGSILCSIHIJVEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QGFQAGRHYWE VDVGQNVGWYVGVCRDDVDRGKNNVTL S PNNGYWVLRLTTEHLY FT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 

TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE 1Q5 



CCT T CACAGG AC TCTT C ATT GC TGGT TGGC AAjTCr AT GT AT CGGCCAGATGT GGT GAGGGCT AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TGACAAACT ATAT GCT GAGTTT GGC AGAGAGGCT TCT AAC AATT T T AC AGAAAT GAGCCAGAGACT TGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 
GCCC TGGC AGGCTAGCCT GC AGTGGGAT GGGAGT CATCGCTGTGGAGC AACCTT AATT AATGCC ACATGGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACC TT CG AAAATG AAAC GGGGT C TCCGGAGAAT AATT GTCC AT GAAAAAT AC AAACACCC ATC AC A 
TGAC T ATGAT AT TTCT CT TGCAGAGCT T TC T AGCCCTGTT CC CT AC AC AAAT GC AGT AC AT AGAGT T T GT C 
TCCCTGAT GCAT CCT AT GAGTT TC AACC AGGT GATGTGAT GT TTGT GACAGG AT TTGGAGCACT GAAAAAT 
GATGGT T ACAGT CAAAATCAT C TT CGAC AAGC AC AGGT GACTCTC AT AGACGCT AC AACT TGCAAT GAACC 
TC AAGCTT AC AAT GACGCCAT AACTCCT AGAATGTT AT GT GCTGGC TC CTT AGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCT GGGG AG ATGAAT GTGCGAAACCCAAC AAGCCT GGTGT T TAT ACT AG AGTT ACGGCC T TGCGGGACT G 
GATT ACT TCAAAAACT GGT ATC TAAGAGAC AAAAGCCT CAT GGAAC AGAT AAC ATT TT TT TTTGTTTTTTG 
GGTGTGGAGGCCAT TT TT AGAG AT AC AGAATT GG AG AAGACT T G CAAAACAGCT AGAT TT GACT GATCTC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATAATACAATATTAC 
ATT ACAGC CT GT AT TC ATTT GT TCT C T AGAAG TT TTGT C AGAAT TT TGACT T GTTGAC AT AAATTT GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
T C AAGGTGCAGAAC AAGGAGTG AAAG AAAAT AT AAGAAGAAAAAAAT CCCCT AC ATTT TAT T GG CAC AGAA 
AAGT AT T AGGT GTT TT T CTT AGTGGAAT AT T AGAAATGAT CAT ATT CATT AT GAAAGGTC AAGC AAAGAC A 
GC AG AATACC AAT CACTTCATC ATTT AGGAAGT AT GGG AACT AAGT TAAG GAAGT C C AGAAAGAAGCCAAG 
AT AT ATCC TT ATT T TC ATTT CC AAAC AACT ACTATG AT AAAT GT GAAGAAGATT CT GT TT T T TT GT GACCT 
AT AAT AAT TATACAAAC TTCAT GC AATGT ACT TGTTCT AAG C AAAT TAAAGC AAAT AT T T AT TT AAC ATT G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 



FIGURE 106 



MMYRPDWRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVIJ^MLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT T CNE PQAYNDAI TPRMLCAGS LE GKT DACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT T C TCACT T TAAC T GCAGAAACGAGGGTAGAGGAAGCAGTCAT TT TGAC T TAC TTT CC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGT CACT T T GAAAGC CAGGATGACAAAT T AT GGAT T ACCTAGATAT CGGT GGCT TACT CA 
TGCT T GGAAT TT T TTT CAGAGAGAGT T TAAGT GCTGTGGAGTAGTATAT TT CACT GACT GGT T GG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
T T TGAGAGGAACCAAACAAC T GCAGGTGC T GAGGT TT CT GGGAATC TCCAT TGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
GT T GAAACCAAGCCTGT CAAGAAT CT T TGAACACACAT CCATGGCAAACAGCT T TAAT ACACACT 
T TGAGATGGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACT TGTTTTAT TGGACT 
T GTGAATT T T TGAGTACATAC TATGTGT TT CAGAAATATGTAGAAATAAAAAT GT TGCCAT AAAA 
TAACACC TAAGCATATACT AT TC TAT GC TT TAAAAT GAGGAT GGAAAAGT T TCATGT CATAAGTC 
ACCACCTGGACAATAAT TGAT GC CCT TAAAAT GCT GAAGACAGATGTCATACCCACT GT GTAGCC 
TGT GTAT GAC TTT TACT GAACACAGT TATGT TTT GAGGCAGCATGGTT T GAT TAGCAT T T CCGCA 
T CC AT GCAAACGAGT CACATATGGTGGGAC T GGAGCCAT AGT AAAGGT TGAT T TAC T TCT ACCAA 
C TAGTATATAAAGT ACTAAT TAAAT GC TAACAT AGGAAGTTAGAAAATAC TAAT AAC TTT TAT TA 
CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
ACCTAAATGTGATT TT TGCT GGT TACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
TT CAGT T CT GATAAT GT TAAGAATAACCAT TATGAAAAGGAAAAT T T GT CCTGTATAGCAT CAT T 
AT T TT TAGCCT T T CCTGT T AATAAAGC T T TAC TAT T CT GTCCT GGGC T TATAT TACACAT ATAAC 
TGT TAT T TAAAT ACT TAACCAC TAAT T T TGAAAAT TAC CAGT GTGATACATAGGAAT CAT TATT C 
AGAATGTAGTCTGGTCTT TAGGAAGTATTAATAAGAAAATT T GCACATAACTTAGT TGATTCAGA 
AAGGAC T TGTAT GC TGT T TT TCTCCCAAATGAAGACT C T TT T TGACAC TAAACACT T TT TAAAAA 
GC T TAT C T T TGCCT TC T CCAAACAAGAAGCAATAGT CT CCAAGTCAATATAAAT TCT ACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTTTGTTT TAT T T C AC T GAT TAAT AT AC T G T G G C AAAT T AC AC AGAT TAT TAAAT T T T T T T ACAA 
GAG TATAG TATAT T TAT T TGAAATGGGAAAAGTG CATT TT AC T GTAT T T T GTGTAT T TT GT T TAT 
T T C T CAGAATATGGAAAGAAAAT TAAAATGT GT CAATAAATAT TT TC TAGAGAGT AA 



FIGURE 108 



MAREDSWCLRCLLYALNLLFWmSISVIAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPVVH^ 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGTCTTTCTC GT GAGAGCCTAGAGGC CTT AAAAAAAAAAGTGCTT GAAAGAGAAGGGGAGAAAGGAAC A 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGAT6CCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACC AC C AGTT GG AT GAGT CT C AAG GTCCTCCTCTATGT GAC AACCATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
H TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
Ul CTGACCAAGCCCAGC GTCTGCTTCCACGTCT ACTGT GGTCATTTTTAT GACATCTGCGAC GAGGACTGCCATGG 

2f C AGCT GCTC AG AT AC C AGC GAGT GC AC AT GC GC TCC AGGAACT GT GC T AGGC CCT GAC AGGC AGAC AT GCT T T G 

p! ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGT GTGAACCTCAAAAACTCCTACCGCTGT 

III 



III 



GAGT GT GGGGTT GGCCGT GT GCT AAGAAGTGATGGCAAGACTT GTGAAGACGTT GAAGGAT GCCACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGATAACCACACTT GCC AAGTCCCT GT GTT GT GCAAATCAAAT GCC ATT GAAGTGAACATCCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
Ul CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
f|| TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
C C GGGAT G GCT GT GT T TCAGATGAC T C GGT AAAGC AGT AC ACAT C C C GGGAT C ACC T AGC AAAGC ACT T C C AGG 
TCCCTGTCTT CAAGT T T GT GG GC AAAG AC CACAAGGAAGT GT T T C T GC ACT GCC GGGT T CT T GT CT GT GGAGT G 
TT GGAC GAGC GTT CC C GC T GT GC CC AGGGTT GCC AC C GGC GAAT GC GT C GT GGGGC AGGAGGAG AGGAC T C AGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGT CCCTGC ATT GGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT C AG AAGC T GGGT AT AAT AT T T CAAGT T AC AAACCC T AGAAAAAT T AAAC AGT T ACT GAAATT AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGGGAATTT GGAAGT GT ATCAAT AAAAC AGT ATAT AATTTT 
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FIGURE HO 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACAS FNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERS RCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPI RI DWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 

GAGAGAGG CAGCAGCT T GCT CAGC GGAC AAGGATGC TG GGCGTG AGGG ACCAAGGCCTGC CCTGCACT CGG 
GCCT CCT C CAGCCAGT GC TGACCAGGGACT TC TGACCT GCT GGC C AGCCAGGACCT GTGT GGGGAGGC CCT 
CCT GCT GCCT TGGGGT GACAATCT CAGC TC C AGGCT AC AGGG AGACCGGGAGGATC AC AGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGCAGCT GT GTGACGGAGAGCT GGACT GTCCCT TGGGGGAGGACGAGGAGC AC T GTGTC AAGAGCT TC 
CCCG AAGGGCCT GC AG T GGC AGTC CGCC TCTCCAAGGACCGATCC ACACTGC AGGT GCT GGACT CGGC C AC 
AGGG AACTGGTT CTCT GC CTGTTT CG AC AACT TC AC AGAAGC TCTCGC TG AG ACAGCCTGT AGGC AGAT GG 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
T C AG CATC C AGT ACGACAAAC AGC ACGT CT GTGG AGGGAG CATCCT GGACCCCC AC T GGGTC CT C ACGGC A 
II GCCC AC T GCT TC AGGAAAC AT ACCGATGTGTT C AACTGGAAGGT GC GGGC AGGCTC AGAC AAACT GGG C AG 

CTTCCC AT CCCT GGCT GT GGCC AAGAT C ATCATC AT TGAAT TCAACCC CATGT ACC CC AAAGACAATG AC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTC ACT C C AGCCACCCC AC TCT GGATC ATTGGATGGGGC TT T ACGAAGC AG AAT GGAGGGAA 
GAT GTCT GAC AT ACTGCT GCAG GC GTCAGTCC AGGT C ATT GACAGC AC ACGGTGCAAT GC AGACGATGCGT 
AC C AGGGGGAAGT CAC CG AGAAGATGAT GT GT GC AGGC AT CCCGGAAGGG GGTGT GGACACCTGCC AGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CT GC GG GGGCCCGAGCAC CCC AGGAGT ATAC ACCAAGGTCTC AGC CT ATCT C AACT GGAT CT AC AATGT CT 
GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG AT CC CCCAAAGT CAGAC AC AGAGC AAGAGT CCCCT T GGGT AC AC CCCT CT GC CC AC AGCCTC AGCAT 
TT CT TGGAGCAGC AAAGGGCCT C AATT C CT GT AAGAGACC CT CGCAGC CC AGAGGC GC CCAGAGGAAGT C A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGT ATT GCT AAGCC AAGAAGGAAC TTT C CC ACACT ACT GAAT GG AAGC AGGCTGT CT TGT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GC CT ACT AGAGC AAGAAAC C AGTT GT AATATAAAAT GCACTGCCCT AC TGTT GGT ATG ACT ACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVE IGPDQDLDWE ITENSQELRMRNS SGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GS FP S LAVAKI 1 1 IE FNPMYPKDNDI ALMKLQFPLT FSGTVRP I CL P FFDEELTPAT PLW 1 1 GWG 
FTKQNGGKMS D I LLQAS VQVI DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lift 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCC TC TGAACAACT T T T T CAGCAAC TAAftAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCGTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 

GGCCCAAACC T GACCT T CAC TCT GGAAC GAGAACAGAGGT TT C T ACCCACACCGT CCCCT CGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
C T TGCCCTGGCCG TAGAAGGGAT TGACAAGCCCGAAGAT T T CATAGGCGATGGCT CCCACTGC CC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCGGTGGACACCTGCTCA 

GAAGCAGT GGGTGAGACAT CACGC TGCCCGCCCATCTAACC T TT T CATGTCC T GCACAT CACCTG 
W ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
% CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
f|J GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
* GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
Q CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
J;JJ CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
H TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
Q CT CC T TC CC T CT GAGAGGC CCTCG TATGTCCCT AC TAAAGCCACCAGCAAGACATAGCTGACAGG 

11 GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C T CAAT T T AAAT CAT G T T CT AGTAAT T GGAGCT GT CCCCAAGACC AAAGGAGC T AGAGCT T GGT T 
CAAATGAT C T CCAAGGGCCCT TATACC CCAGGAGACTT T GAT T TGAAT T TGAAACCCCAAATCCA 
AACCTAAGAA.CCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAA.CAT 
TT TGGGAGGCCGAGGCGGGTAGATCACC TGAGGTCAGGAGTT CAAGACCAGCCTGGCCAACAT GG 
TGAAACCCCTGTCTCTACTAAA^TACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TAC T CGGGAGGCTGAGACAGGAGAAT T ACT T GAACCT GGGAGGT GAAGGAGGCT GAGACA 
GG AGAAT CAC T T C AG C C T GAG C AACAC AGC GAGAC T C T G T C T C AGAAAAAAT AAAAAAAG AAT T A 
TGGTTATTTGTAA 
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FIGURE 114 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



M 
O 
0 
m 
ill 
Pi 
m 
ill 

Cl 
-M 
13 
W 

ni 
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FIGURE lis 

CAGCAGT GG TC T CTCAGTCCTC TCAAAGGAAGGAAAGAGTACT GTGT GC TGAGAGACCATCGCAA 
AGAAT CC TCCAGAGAATT GT GAAGAC TGTCACAT TCTAAAT GCAGAAGC TT T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCT7UVCTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C TACAGCAATGGAGAGAAGAAGAAGATT TACAT GGAAAT TGAT CC TGT GACCAGAACT GAAATA 
T T CAGAAGCGGAAAT GGCACT GAT GAAACAT TGGAAG T GCACGAC T T TAAAAACGGAT ACAC TGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CTGAACCAGAAGAGGAAATAGAT GAGAAT GAAGAAAT TACCACAACT T TC T T TGAACAGTCAGTG 
AT TT GGGT CCCAGCAGAAAAGCC TAT T GAAAACC GAGAT T T TC T TAAAAAT TCCAAAAT TCT GGA 
GAT T TGTGATAACGT GACCATGTATTGGAT CAAT CCCACTC TAATATCAGT T TCTGAGT TACAAG 
AC T T TGAGGAGGAGGGAGAAGATC TTCAC T T T CCT GC CAACGAAAAAAAAGGGAT T GAACAAAAT 
GAACAGTGGG T GGT CCC TCAAGTGAAAGTAGAGAAGACCCGT CACGCCAGACAAGCAAGTGAGGA 
AGAAC T TCCAAT AAAT GACTATAC TGAAAATGGAAT AGAAT T TGAT CCCAT GC TGGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTAC TACCCATAT CCATAC T GCTACCAAGGAGGACGAGTCAT C TGT CGTGT CAT CATGCC T TG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCT GGCAACATATAATAAAT GCATGC TAT T CAATGAATTT CTGCC TAT GAGGCATCT GGCCCCT 
GGTAGCCAGCT C T CCAGAAT T AC T TG TAGGTAAT TCCT CTC T TCAT GT T CTAATAAACT TCTACA 
T TAT CACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERG YCC I YCRRGNRYCRRVCEPLLGY YP YP YC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites, 
amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCC GGCTGC AT CGCGGCCACC GG GAT GGACATGTGGAGCACCC AGGACCT GT ACGAC AACCCCGT CACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGT T TCTT AT CATGCCTC AGGC CACAGT GTTGCCT AC AAGCCTGGAGGCT TCAAGGCC AGCACT GGCTT 
TGGGTCCAACACC AAAAACAAGAAGAT ATACGATGGAGGT GC CCGC AC AGAGGACGAGGT ACAAT CTT ATC 
CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAAC AAGG AGAT CCCATC TAGATT T CTT CT TGCT TT T G AC TC ACAGCT GGAAGTT AGAAAAG CCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
TT CT ACT CTGAT GAGAGAAT GT GGTTTT AATCTCTCT CTCAC ATTT TGATGATT TAGACAGACTCC CCCT C 
TT CC TCCT AGTC AATAAACCCAT T GAT G AT CT ATTT CCCAGC TT AT CCCC AAGAAAACTT T TGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
ACACTGT GAT CT T AAAAGTT ACCAAACCAAAGT C AT TT TC AGTTTGAGGC AACCAAACCT T TCT ACTGCT G 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGT C AGAAAT TGT CCC T AGATG AATGAG AAAAT T AT t T T T T T TAAT TT AAGT CCTAAAT ATAGTTAA 
AAT AAAT AAT GT TT T AGT AAAATGAT AC ACT ATCT C TGTGAAAT AGCC TC ACCCCT AC AT GTGG ATAGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT C AC ACCT GT AAT CCT AGC ACTT TGGG AG GCT GAGGAGG AAGGAT CACTT GAGCCC AGAAGT 
TCGAGACT AGCCT G GGCAAC AT GGAGAAGCCC TGTCT C TACAAAAT AC AG AG AGAAAAAAT C AGCC AGT C A 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TGGGGCT GCAGTG AGCCATG AT CACACC ACT GCACT CC AGCC AGGT GAG AT AGCGAGATCCT GT CT AA&AA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



10063592 



Page 267 



FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRAIMIVGIVLGAIG 

CAI AGVSVFANMLVTNFWMSTANMYTGMGGMVQTVQTRYT FGAALFVGWVAGGLTLI GGVMMCIA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 



APP ID=10063592 



Page 268 



FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT GCACGGGGGACAAT GAGAAGGTGAAGGCT CACAT TCTGCT GACGGCTGGAATCATCT TCAT 
JJ" CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
m CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
l|| C AGATAC T C GAT ACC T T CC CATCG CACAACC CAAAAAAGT T AT CACAC C GGAAAGAAGT GACC G AGC G 

111 TCTACTCCAGAAGTCAGTATGTGTAGT T GTGTATGTT TTT T TAACTT TACTATAAAGCCAT GCAAAT G 

• ACAAAAAT C TAT AT T AC TT TC TCAAAAT GGAC C CCAAAGAAAC TTT GAT TTAC T GT T C T TAAC T GCC T 

ni 

* AATCTTAATTACAGGAACT GT GCAT CAGCTATT TATGATT CTATAAGCTAT TTCAGCAGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
|'r| CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
PI ACGACATAGCAT TAT GTACATAGATGAGT GTAACATT TATATCT CACAT AGAGACATGCTTATATGGT 

||| TTTATTTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
|?l ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
^ TTTATAAT GAAGATTAAAATGAAGGCTT TAATCAGCATTGTAAAGGAAATTGAAT GGCT TT CT GATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATTT TT GT TTT T GTAT TTGAAGAAGAAT GATGCAT T TTGACAAGAAATCATATAT GTAT GGAT 
ATATT TTAATAAGTAT T TGAGTACAGAC T TTGAGGTT TCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
T ATACAT T TAT AT TAATAAAT TGTACAT T TT TCTAATT 
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FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWR 
IYDSLLALSPDLQAARGLMCAASVMSFI^ 

MWLIPVSWVANAIIRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTC TGAGGGAAAGCT T T GAGGAGTCCT GGACACCCAACT AC 
Q AAGCAGT GT TCAT GGAGT T CAT T GAATTAT GGCATAGAT CT TGGGAAAAT T GCGGAGT G T ACAT T 

P TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
rl GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
Tn AT TGAAGC TATAATTTATT TGGACCAAGGAAGCCCTGAAATGAAT TCAACAATTAATAT TCAT CG 

CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
HI T T GGCACTTGTT CAGAT TAC CCAAAAGGAGATGCT T C TACT GGATGGAAT TCAGT T TCT CGCATC 

5i AT TAT TGAAGAACTACCAAAATAAAT GC T T TAAT T T TCAT T T GC TACC TCTTTTTT TAT TAT GCC 

!% T TGGAATGGT T CACT TAAAT GACATT T TAAATAAGT T TATGTATACAT C TGAATGAAAAGCAAAG 

jSi CTAAAT AT GT TTACAGACCAAAGT GT GATT TCACACT GT T T T T AAATC TAGCAT TAT TCAT T T TG 

!|| C T TCAATCAAAAGT GGT TT CAATAT T T T T T T TAGT T GGTTAGAATAC T T TC TTCATAGT CACATT 

p CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
^ AAAAAATATAAAAGC TACCAATC T T TGTACAAT T TGTAAATGTTAAGAATT T T T T T T ATATCTGT 

TAAATAAAAAT TATT TCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCTGGAAGAATACAT CATGTTT TTCGATAAGAAGAAATTGTAGGATCCAGT TTTT TT TT TA 
ACCGC CCCC TCCCCAC CCCCCAAAAAAAC T GTAAAGATGCAAAAACG TAATATCCAT GAAGAT CC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC TAAGCCAAAGCAAAAGACCTAAGGACGACCTT TGAACAATACAAAGGATGGGT T TCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
j*b CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
f*i ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
*S- ATAACAGCCT T CAAAAACT TAAGTATAATCAAT T TAAAGGGCTCAACCAGCT CAC CTGGCTATAC 

CT TGACCATAACCATATCAGCAATATTGACGAAAAT GC TT TTAATGGAATACGCAGACTCAAAGA 
5:rl GCTGATTC TTAGT TCCAATAGAATC TCCTATTTTCT TAACAATACCT TCAGACCT GTGACAAATT 

Iff TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
m CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
$j AGAC TGCCGCAACCT GGAACT T T TGGACC TGGGATAT AACCGGATCCGAAG T T TAGCCAGGAATG 

HI TC TT TGC TGGCAT GAT CAGACTCAAAGAAC T TCACCT GGAGCACAATCAAT TTT CCAAGCT CAAC 

I'll CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
;? CATAGGACAGACCAT GTCCT GGACCTGGAGC TCCT TACAAAGGC T TGAT TTATCAGGCAAT GAGA 

fn TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
lS TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
^ TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
fcl T TAAAGGTC TAAGGGAGAAT ACAAT TATC TGTGC CAGT CCCAAAGAGC T GCAAGGAGTAAATGTG 

|f| ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACT ACAGAGAGGT TT GATCTGGCCAG 

Q GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
SI TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
l?l TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCC TAAAGCAAATGACTCCCAGCACGCAGGAATT TTATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCT GCTGAATGGGACGGGACCCT GCACCTA 
TAACAAATCGGGC TCCAGGGAGT GTGAGGT ATGAA CCAT T G T GATAAAAAGAGCT CT TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAAC T C CG G T T T AAT AT AATAC C TAT T G TATAAGACC C T T T AC T GAT T C CAT T AAT GT C GC AT T T 
GTTTTAAGATAAAACTTCT TTCATAGGTAAAAAAAAAAA 
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FIGURE 124. 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 

LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 

PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 

IAROTFAGMIRLKELHLEHNQFSKLNIALFPRLVSLQNLYLQWNKISVIGQTMSW 

SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 

WLKSFKGLRENTIICASPKELQGVWIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 

SKPPLPPTVGATEPGPETDADAEHIS FHKI IAGSVALFLSVLVILLVI YVSWKRYPASMKQLQQR 

SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCTTTCTAAAGCT TTGGCAAGGAGTGACACCCGCCATT TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
l,t ; ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
C| T TAGCCAAT CCAACT GACCTAGT GAAGGT TCAGAT GCAAAT GGAAGGAAAAAGGAAAC T GGAAGG 

AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
'•ll GAGGGCTT TGGGCAGGGTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGAT TTA 

||| ACCACT TATGAT ACAGT GAAACAC TAC TTGGTAT T GAAT ACACCACTTGAGGACAATAT CAT GAC 

|f| TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
I'll TCAAAAGCAGAATAAT GAAT CAACCACGAGAT AAACAAGGAAGGGGAC T T T T GT ATAAAT CAT CG 

* AC TGAC TGC T TGAT T CAGGC TGT TCAAGGTGAAGGAT T CAT GAG T C T AT AT AAAG G C T T T T TACC 

r! 

ATCTTGGCT GAGAAT GACC CCT T GGT CAATGGTGTTCT GGC T TAC T TATGAAAAAATCAGAGAGA 
|*t TGAGTGGAGTCAGTCCATTT TAA. 

iU 
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FIGURE 126 



\ 



hi 

13 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATV7VELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVVYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGGFIAlNfPTDLW 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 

CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGG.CATGG 

AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 

CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 

GCTGGC TGC TGAATGTAAGAGT GCAGGC TACCCCGGGACT T T GAT CCCC TACAGAT GT GACCTAT 

CAAATGAAGAGGACATC CT C TCCATGT TCT CAGC TAT CCGT TC TCAGCACAGCGGTGTAGACATC 

|„l TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 

Q GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 

M AG GAGC GGAAT G T G GAC GAT G GGCACAT CAT T AAC AT CAAT AGCAT G T C T GGC CACC GAG T G T T A 

J*! CCCCTGTCTGTGACCCACTTCTATAGTGCCACC7^AGTATGCCGTCACTGCGCTGACAGAGGGACT 

III GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

s ft AGACACAAT TCGCC TT CAAACTCCACGACAAGGACCC T GAGAAGGCAGC TGCCACC TAT GAGCAA 

III ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 

CATCCAGAT T GGAGACATCCAGAT GAGGCCCAC GGAGCAGGT GACC TAG T GAC T GT GGGAGCTCC 
r! 

TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
«K CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
!|| TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
Q TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAV^ 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGIARPDTLLSGST^^ 

EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWETQFAFKLHDKDPEKAAAT YE QMKC LKPE DVAE AV I YVLS TPAHI Q I GD I QMRP TE QVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 

AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTGTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T T AAT T T T TAAC T GAT AG T T G T ACAT AT T T GG GG G T ACAT G T GAT AT T T GGAT ACAT G T AT ACAA 
gj TATATAAT GATCAAAT CAGGGTAAC T GGGATAT CCAT CACAT CAAACAT T TAT T T T T TAT TCTT T 

CI T TAGACAGAGTCT CACTCTGTCACCCAGGC T GGAGTGCAGT GGTGCCATC TCAGCT TAC TGCAAC 

0= CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
X GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
J\ CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
flf GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

* ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

fit 

ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 

J* AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 

iJl AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

Q TTCCATCCATGT T GCT GCAAATGACAGGAT T TCGT TCT TAAT TTCAAT TAAAATAACCACACAT G 

W GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide : 

amino acids 1-18 

N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 

TT CT GAAGTAACGG AAGCTACCTT GT AT AAAGACCT CAAC AC TGCT GACC ATGATC AGCGC AGC CT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACCAGAGGAT GCT AC AAC TC TCT AC CT T CAGAAC AACC AAAT AAAT AATG CT GGGAT T CCTT CAGAT 
TT GAAAAACT T GCT GAAAGT AGAAAG AAT AT ACCT ATACC AC AACAGT TT AG AT GAATTT CCT ACCAACCT 
CC CAAAGT AT GT AAAAGAGT T ACATT TGCAAGAAAAT AAC AT AAGGAC TAT C AC T TAT GATTCACT TTCAA 
AAAT TCCC TAT CTGGAAGAAT T AC ATT T AG AT GAC AACTCTGT CT C TGC AGT T AGC AT AGAAGAGG GAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
£J| GCCC AG GACT AT AGAAGAACT ACGCT T GGAT GAT AAT C GCAT AT C C ACT ATT T CAT CACC AT CT CT TC AAG 

CI GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 

Hi 

h?* TT CT TC AACCT AGT T AATTTGACAGAGCTGTCCCTGGTGC GGAAT T CCCT GACT GCTGCACC AGT AAAC CT 

;|f TCCAGGC AC AAACCTGAGGAAGCTTT AT CT TC AAGAT AAC CAC ATC AATCGGGT GCCCCC AAAT GCTTTT T 

'H? CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 

a f I 

2| GATGATTT GGACAAT AT AAC AC AACTGATT CTTCGC AACAAT CCCT GGTAT T GC GGGT GCAAGATGAAATG 

GGTACGT GACT GGT TACAAT CACT ACCT GT GAAGGTCAACGT GCGT GGGCT C AT GTGCCAAGCCCC AGAAA 
Q AGGTTCGT GGGATGGCT AT T AAGGAT CT C AAT GC AG AACT GT TTGATT GT AAGGAC AGTGGGAT TGT AAGC 

||! ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
|3 C AAAC AGC CAGAT ATT AAGAACCCCAAGCTCACT AAGGAT CAAC AAACC AC AGGGAGT CC CT CAAGAAAAA 

If! C AAT TACAAT TACT GT GAAGTCT GTCAC CT CTGAT ACC AT TC AT AT CT CT TGGAAACT TG CT CT ACCT AT G 

W* ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
^ AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCAT GGAAACCAGC AACCTCTACCT ATT TG AT GAAACT CC TGTT TGT ATTGAGACT GAAACT GC ACCCCTT 
CGAATGT AC AACCCT AC AACCACC CT C AATCGAG AGC AAG AG AAAG AACC TTACAAAAACCCCAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGCACTAAGAAGGACAACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGATGTTACCAAT 
AAGC AAT GAACCCATCT CGAAG GAGGAGTTT GT AAT AC AC ACCAT AT T TCCT CCT AAT GGAAT G AATCTGT 
AC AAAAAC AAT CAC AGT GAAAG CAGT AGTAACCG AAGC T AC AGAGAC AGT GGTAT T CC AG AC TC AG AT CAC 
TC AC ACT C ATGAT GCT GAAGGACT C ACAGCAGACTTGT GT T T TGGGT T T T TT AAACCT AAGGGAGGTG ATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAI PNTVYPAQGQWPAPVTKQPD 
I KNPKLTKDQQTTGS PSRKT I T I T VKS VTS DT I H I SWKLALPMTALRLS WLKLGHSPAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glyeosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGGAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
T GCAGC CGC T G AGC C T G CG C G T GGGGAT GC T GG GG GAGAAGC T G GAGGC TG C CAT C CAGAG AT CC 
C T CCAC TACCTCAAGCTGAGT GAT CCCAAGTACCTAAGAGAGT TCCAGC TGACCC T CCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
5 5 |f CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
fll AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
« GACCAC TCCAACAGAGCCAGGAC TATAT CAACCTCTT CTGCGCCAACAT GAT GGACT TGAACCGC 

U AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
f£ TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
?5 AACAGCAGGAAGGAT GC TT CGGGGAGCC TGAT GCTGAAGAT GAAGAAT TATC TAAAGCTAT TCAA 

F\ TATCAGCAGCAT T T T T C GAGGAGAGT GAAGAGGCGAGAAAAACAAT T T C CAGAT T CTCGC T CT GT 

1!?" 

Ill TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCT TTGCCTCCTGGGTTCAAGC 

AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGT T CCAT GCCAGCTGCC TGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGAC CCAGGT GT GAACGGATGAATAAAGT T CAAC TGCAAC T GAAAAAAAAAAA 
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FIGURE 134. 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT IADLILSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAI QRS LHYLKLSDPKYLRE FQLT LQPG FWKLPHAW I HT DASLVYPT FGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYIANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 135 

6GTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGATT TGATCCCGTTCGAGTGGATATCACT TCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACC TTC T TAC T T T ATT AAAAGGGAATCGTGGGGCTGGACAGAC T T T CTAATGAACCCAAT GGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACAT GAGACGGGAAAT GGAGCAG TCAATGAATAT GCT GAAT TCCAACCATGAGT T GCCT GAT 
GT TTC TGAGT T CAT GACAAGAC TC TT CTCT T CAAAAT CAT C T GGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGT GGGGCT GGCAAAAGGAGGTAGT CAGGCCGTCCAGAGCT GGCAT T T GCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGT CATCC CGAC GT TGAT CTC T TACAACT GTGTATGT T 

AAC T T T T TAGCACATGTT T T GT ACT T GGTACACGAGAAAACCCAGC TT TCAT C T T T TGTC TGTAT 
GAGGTCAATAT T GAT GT CACTGAAT TAAT TACAGTGTCCTATAGAAAATGCCAT TAAT AAAT TAT 
ATGAAC TAC TATACAT TAT GT AT AT TAAT TAAAACATCTT AATCCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 



FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 

RVLVDGEEHVGFIJCTDGSFVVHDIPS 

VVRLPYPLQMKSSGPPSYFIKRESWGWTDFIMNPMVMMW 

QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites . 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
C C AGAAGGGTGAGCTACGT T GGC T T T C TGGAAGGGGAGGC T ATATGCG TCAATTCC CCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
%l AAGC TGTGT GAT CGCCACAAACC T TCAGGAAATACGAAATGGAT T T TCTGAGATACGGGGCAGTG 

f'| T GCAAGCCAAAGAT GGAAACATTGACATCAGAAT C TT AAGGAGGAC TGAGTC TT T GCAAGACACA 

p AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
W AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
j|>! CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
f|| AAGAAATACAGCCAGAT TCT GAGTCAC T TTGAAAAGCT GGAACCT CAGGCAGCAGT TGT GAAGGC 

, T T TGGGGGAACTAGACAT TC T TC T GCAATGGAT GGAGGAGACAGAATAGGAGGAAAGTGATGCTG 

CI C TGCTAAGAATAT T CGAGGT CAAGAGC T CCAGTCTT CAAT ACCT GCAGAGGAGGCAT GACCCCAA 

*H ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
^! C T TCC T TGCATGAT TGT CT T TAT GCATCCCCAAT CT TAATT GAGACCATAC T TGTATAAGAT T T T 

1^ TG TAATATC T TT C T GCTAT T GGATATAT T TATT AGTTAAT ATAT T TAT T TAT T T T T T GC TAT T TA 

f|J AT GTAT TTAT TT T T T TACT T GGACATGAAAC T T TAAAAAAAT TCACAGATT ATAT TT AT AACCT G 

ACTAGAGCAGGTGATGTATT TTTATACAGTAAAAAAAAAAAACCTTGTAAAT TCTAGAAGAGTGG 
CTAGGGGGGTT AT T CAT T TGTATT CAAC TAAGGACAT AT T TAC TCATGCT GAT GCTC TGTGAGAT 
AT T TGAAAT TGAACCAAT GACTAC T T AGGATGGG T TGT GGAATAAGTT T T GATGT GGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCC TACACGGCCAGCATGTAT T TC T ACAAAT AAAGT T T T CT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKT S FD I SPEMS FS I YSLQVPAVPGLTCWALTAEPGWGQNKGAT TCATNSHSDSE LRPE I F 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 

fseirgsvqakdgnidirilrrteslqdtkpanrccllrhllrlyldryfknyqtpdhytlrkis 
slansfltikkdlrlshahmtchc^^ 

E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGT GAAAT T TAT GC CC TACACAACT TACCT T GTGGAAAAAGGAGCAT CT CACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCT GGAAAAAGAATAGT CT T T TAT GGAGATGAAACCTGGGTTAAAT TA 
T T CCCAAAGGAT T T TG TGGAATATGATGGAACAACC TCAT T T TTCG TGTCAGATTACACAGAGGT 
GGATAATAAT GT CACGAGGCAT T TGGATAAAGTATTAAAAAGAGGAGAT TGGGACATAT TAAT CC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGAT GGACAGCGT GC TGATGAAGATCCACACCTCAC TGCAGT CGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
r ? | GGGCC TCCT CCACCGAGGAGG TGAATACACC T CT GAT T T T AATCAGT TC TGCGTTTGAAAGGAAA 

Jjf C C C GG T GAT AT C C GACAT C CAAAGCAC G TC CA ATAGA C GGAT G T GGC T GCGAC AC T G G C GAT AGC 

W- AC T TGGC TTACCGAT T CCAAAAGACAGT GTAGGGAGCCT CC TATT CCCAGT T GTGGAAGGAAGAC 

ill CAATGAGAGAGCAGT TGAGAT TT TTACATTTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 

f|| AAT GT GCC G T CAT AT GAAAAAGAT C C T GGG T T T GAGC AG T T TAAAAT G T CAGAAAG AT T G CAT GG 

GAACT GGATCAGACT GT AC TT GGAGGAAAAGCAT T CAGAAGTCCTAT TCAACC T GGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 



CI 



111 CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCC TCCCAGCAGAGGT GTGGGAGC T GCAGCT GAGGGAAGAAGAGACAATCGGCCT GGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
AC TCAGATCCACAGAGCCCAGGATCAAGGGACC CAC T GCAGT GGCAGCAGGAC TGT T GGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAGTAT GTAGT TACCAAAAGAATAAAC GGCAAT AAT TGAGAAAAAAAA 
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FIGURE ldO 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVXKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVIiMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites . 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACT GCCAGCACCTTAAGACCACTCACACCT TCAGAGTGAAGAAC TTAAAC 
CCGAAGAAATTCAGCATT CATGACCAGGATCACAAAGTAC T GGTCCT GGAC TC T GGGAAT CT CAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCT T CAGCT GAAGAAGGAGAAAC TGAT GAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT TGGGGTGACAGATAAAT T TGAGAACAGGAAACACATT GAAT TT TCAT TT CAACCAGT T T GCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGAT TAGG AAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T GAAC TAAT T GTATAAAAACACCAAACC TGC T CACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS SLS SASAEKGSPI LLGVSKGE FCLYCDKDKGQSHPS LQLKKEKLMKLAAQKE SARRP FI 
FYRAQVG S WNMLE S AAH PG W F I C T S CNCNE P VGVT DKFENRKH I E FS FQPVCKAEMS PSE VS D 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAA.CAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACAT C TTGAAT T CCT GGAAGGAGCAGG TGGAGTCCAAGAC TGTATTC TCAATGGAGCTACT GCT 
GGGGAGAACTAGGTGTGGGAAATTTGAAGACGACATTGACAACTGCCATTTCCAAGAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTT GGAGGGATTCCACTGAGTGAAACCCACTCACAGGCT TGT CCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATCTTGAGCCTAATCATGTAGTGTAGATCAT TAAACATCAGCATTT TAAGAAAAAAAAAAA 
AAAAAkAAAAAAAAAAAkAAAAAAAAAAAAA^ 
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FIGURE ldd 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site, 

amino acids 9-15 
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FIGURE 145 



C TGTGCAGC T CGAGGCT CCAGAGGCACACTCCAGAGAGAGCCAAGGT TC TGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCAT CAAGCACAGAATCAAGTGGAACCGGAAGGC CC T GCCCAGCACT GCCCAGAT CA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGT TCCAGAAGCCAGACAACAAGCTCCACCAGCAGGT GCTCT GG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 



CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

in 

TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
III CCGTCAC TCCAT T CT CAGCCCCTAGCAGAGCGT CT GGCACACTAGAT TAGTAGTAAATGC T TGAT 

$ GAGAAGAACACAT CAGGCAC T GCGCCACCTGC T TCACAGTAC TTCCCAACAAC TC T T AGAGGT AG 

CI GT GTATT CC CGTT T T ACAGATAAGGAAACTGAGGC CCAGAGAGC TGAAGTAC TGCACCCAGCATC 

m 

tf* ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

H GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 

III 

0% CTC TAATGAAAT TGTGAAAGCTCCATGT TTAGAAATAAATGAAAACACCTGA 
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FIGURE 146 

MRKHLSWWW^TVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

3 : 

=8:3 ~ 

f*\ Transmembrane domain: 

HI amino acids 157-171 

^| N-glycosylation sites. 

2t amino acids 98-102, 110-114 

W 

Q Tyrosine kinase phosphorylation site. 

\M amino acids 7 6-83 

^ N-myristoylation sites. 

U 

m. amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCT TAT T T TAAT TAGTAGCATC TAC TCAGAG TCAT GCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T TCAGAT T CAGGGGGTACATGTGAAGGT T T GT T T TAT GAG TAT AT T GCATGAT GCTGAGG 
TTTGGGGT 
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FIGURE 148 

MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE ldQ 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATGCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGC T ACAG C T C CAGGAGCC C AGC GCC GGGC T GT GAC CCAAGC C GAG C GT G GAAGAATGG GGT T 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
. GCCAAGACAAATC TCTACATAATAGAGAATTAAGT GCAGAAAGACCT T TGAATGAACAGAT TGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACAT ATCCTCCAGAAAACAAGCCAGGT CAGAGCAACTAT T CTT T 
h'~ TGTT GATAACTT GAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATT GAGAAAGAAAGACAATCTA 

;;*f T AAGAAGC T CC C CAC T T GATAAT AAGT T GAAT GT G GAAGAT GTT GAT T C AAC CAAGAAT CGAAAACT G 

JjJ ATCGAT GATTAT GACT CTACTAAGAGT GGAT T GGATCATAAATTT CAAGATGATCCAGATGGTCTTCA 

i[] TCAACTAGACGGGACT CCT TT AACCGCT GAAGACAT T GT CCATAAAAT CGCT GCCAGGAT T TAT GAAG 

ill AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTAT^ 
||'| CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
ill T TAT GAG GAGGAT CC CAAT AAGC CCACAAGC T G GAC T GAGAAT CAGG C T GGAAAAATACC AGAGAAAG 

¥ TGACT CCAATGGCAGCAAT T CAAGATGGT CT TGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 

M TTAACCTT GACAAAT GGCT TGGAAAGGAGAACTAAAACCTACAGT GAAGACAACT TTGAGGAACTCCA 

III 

^ ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
j*j CACTGATTACTAT CATGAAAACACTGATTGACTTTGTGAAGATGATGGT GAAATATGGAACAATATCT 

g| CCAGAAGAAGGT GTTT CCTACCT TGAAAACT TGGAT GAAATGATT GCT CTT CAGACCAAAAACAAGCT 

|1| AGAAAAAAATGCTACT GACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCAT GAAGAAA 

CAGACAGT ACC AAGGAAGAAGCAGC TAAGAT GGAAAAGGAATAT G GAAG CT T GAAGGAT T C CACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGAT GAGAGACTT CAT CAATAAACAAGCT GAT GCTTATGTGGAGAAAGGCAT CCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGC GCATTTATAGCAGCCT G TAAA AAT GGCAAAAGATCCAGGAGT CTT T CAA 
CT GT T T C AGAAAACAT AAT ATAG CT TAAAAC AC T T C T AAT T CT G T GAT TAAAATT T T T T GAC C C AAG G 
GTTATTAGAAAGT GCT GAAT T T ACAGTAGTT AACC T T T T ACAAGT GGT TAAAACATAGCTT TCT TCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEAI^NYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKS IDSEKEAKEKETLI T IMKTL IDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAI RKNIEVJLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CI 
m 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
G ATG GTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGC T T CTAGC T GGAGGGC TGCATGCAGGGAAGGTCAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGT GCC T G T CATGT GGGGTGGGGCAGGAGCCGACT C TAACACTAGAGCCAGT GAACAT CAT GG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T G TCAGAC TCACCCAGC TT CC GGAGAATGGT GGC T GGAAT GCCCCCAT CACAGAC T T CT ACT T CC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTC TCCACAT GAAGTCC T GT CACT CACCACTGTGCAGGAGAGGGAGGT GGTCATA 
III GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
||l TGCTACC T T TCCT AT C T CT T CCCT CATCAT C T T GT TGT GGGCAT GAGGAGGT GGTGAT G T CAGAA 

Tfi GAAAT G G C T C GAG C T C AG AAG A T AAAAG AT AAG TAG G G T AT G C T GAT C C T C T T T T AAAAAC C C AA 

2t GATACAATCAAAATCCCAGATGC T GGTCTC TAT T CCCAT GAAAAAGTGC TCATGACAT AT TGAGA 

5 *- 3 AGACCTACTTACAAAGTGGCATATATTGCAATTTATT TTAATTAAAAGATACCTAT TTATATATT 

* TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
Q AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
m AT TCC T TGTATAAAAATAAGAAAAGAAAT TAAT C T TGAGGTAAGCAGAGCAGACATCAT C TC T GA 

S TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
**j T TGTAGTAGT GATCAGGAAACAGATC TCAGCAAAGCCACTGAGGAGGAGGC TGTGC T GAGT T TGT 

111 GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
Q GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
?|j ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
Vf ' GTCTCTGCAGATGTAGT TAGT TAAGACAAGGTCAT GCTGGATGAAGGTAGACCTAAATTCAATAT 

GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGAT CGGAGT T T TGCAGCCACAAGC TAAGAAACACCAAGGAT T GT GGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGAT7VACGGCTCTG 
C T GAAACCT TAAT CTCAGAC T TCCAGC C TC C T GAACGAAGAAAGAATAAAT T TCGGCT GT TT TAA 
GCCACCAAGGATAAT TGGT TACAGCAGC TC TAGGAAAC TAATACAGCTGCTAAAAT GATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGT GATGGCATGCCACT TCCAAGATTAGGT TATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGC T GCCAT GCTAT GAGCAGGCC TATAAAGAGACT TACG TGGTAAAAAATGAAGT CTCCT 
GC CCACAGCCACAT TAGTGAACCTAGAAGCAGAGACTC T GT GAGATAAT CGAT GTTTGTTGTTTT 
AAG T TGC TCAGT T T T GGT CT AAC T T GT TAT GCAGCAAT AGAT AAATAATAT GCAGAGAAAGAG 
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FIGURE 1S2 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites ♦ 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTGTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGAT GACCTGCATATCCAGAGGAAT GT GCAAAAGCTGAAGGACACAGTGAAAAAGC 
T T GGAGAGAG T GGAGAGAT CAAAGC AATTGGAGAACT GGAT T TGC TGT TTATGTCT CTGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGAT t T TT T T TAACCAAAAGGAAGAT GGGAAGCCAAAC TCCATCATG 
ATGGGTGGAT TC CAAAT GAACCCC TGCGTTAGT TACAAAGGAAACCAAT GCCACT T TTGT T TATA 
AGACCAGAAGGTAGACT TTCTAAGCATAGATAT TTAT TGATAACATTTCAT TGTAACTGGTGT TC 
TATACACAGAAAACAAT TTATTT TTTAAATAAT TGTCTTTTTCCATAAAAAAGATTACT T TCCAT 
TCCT T TAGGGGAAAAAACCCCTAAATAGCT TCAT GTT T CCATAATCAGTAC T T TAT ATT TAT AAA 
TGTAT T TAT TATTAT TATAAGAC TGCAT T T TAT T TATATCAT T T TAT TAATATGGAT T TAT TTAT 
AGAAACATCATTCGATATT GCTAC T TGAGTGTAAGGC TAAT AT TGATATT TATGACAAT AAT TAT 
AGAGC TATAACAT GTT TAT T T GACC TCAATAAACACT T GGATATCCC 
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FIGURE ISA 

MAALQKS VS S FLMGTLATSCLLLLALLVQGGAAAPI S SHCRLDKSNFQQP Y I TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 ■ 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 

tcttggcaatggtcatgggaacccacacctacagccactggcccagctgctgccccagcaaaggg 
caggacacctctgaggagctgctgaggtggagcactgtgcctgtgcctcccctagagcctgctag 
gcccaaccgccacccagagtcctgtagggccagtgaagatggacccctcaacagcagggccatct 
c cccc t ggagatat gagtt ggacagagac t tgaaccggctcccccaggacctgtaccacgcccgt 
tgcctgtgcccgcactgcgtcagcctacagacaggctcccacatggacccccggggcaactcgga 
gctgctctaccacaaccagactgtcttctacaggcggccatgccatggcgagaagggcacccaca 
agggctactgcctggagcgcaggctgtaccgtgtttccttagcttgtgtgtgtgtgcggccccgt 
gtgatgggc tagc cggacctgctggaggctggtccctttttgg'gaaacctggagccaggtgtaca 
accacttgccatgaagggccaggatgcccagatgcttggcccctgtgaagtgctgtctggagcag 
caggatcccgggacaggatggggggct t tggggaaaacc tgcac t t ct gcacat tt t gaaaagag 

CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T T GACAT GGATGAT T CT GAGGAGGAAGCTGT TAT T GAAT GT ATAGAGAT T TAT CCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACYCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site, 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACCTCCGAGTAGAACCTGT TACAACTAGTGTTGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAG TCTATT T CAT TGGGGCCCATAAT AT TCC TAATGCAAATATGAATGAAGAT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGT GAACT TCACAACCAC TCCCC TGGGAAACAGATACAT GGCT C T TATCCAACACAGCAC 
TATCATCGGGTTT TCT CAGGTGTT TGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAAT T TGT TAC T TCACTGAATT T CT TCAAAACCATT GCAGAAGTGAGGTCATCCTTGAAA 
AG T GGC AGAAAAAGAAAAT AG C AGAGAT GG G T C C AG T GC AG T GG C T T GC C AC T CAAAAGAAG G CA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATC TAAGAAGCCAGATT CATCTGCACAAATACGTGGTGGTCTACT TTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCT^AGTACCACCTCATGAAGGATGCCACTGCTTT 
CT GTGCAGAACT T C TCCATGTCAAGCAGCAGGT GT CAGCAGGAAAAAGAT CACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTG TAG 
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FIGURE 158 

MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSIIi^VSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NAlSnyiNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQ^ 
STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPY FPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QN HC RS E V I L E KWQ KKK I AEMG P VQWLATQ KKAADKWFLLS ND VN S VCDGT CGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYWVYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino. acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids .116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATGACAGT GAAGACCC T GCATGGCCCAGCCATGG TCAAG TACT TGC T GC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
AC T T GGGC T GCAT C AAT G C T C AAGGAAAGGAAGACAT C T C CAT GAAT TCCGTTCC CAT C CAG CAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 

H GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAA GAGGTGCATATCC 

1% ACTCAGCTGAAGAAG 



4 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEIAAAilKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 



N-glycosylation site. 

amino acids 83-87 



© 
p 
m 

ill N-myristoylation sites. 

tf= amino acids 106-111, 136-141 

11 

C3 
III 
0 
W 
0 

rii 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGGTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGT GCACAAAC TAC CCAGCACAGCC CCC T CCGCCCC CT C TGGAGGC TGAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGAT GAGGAAAAGTT TGGAGGAGCAGC TGAC TCAGGGGTGGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
M TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
Q TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
f*\ GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGA^ 
|fl CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
|ll CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
Jft CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
IZ GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
r ? 1 TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
s AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
Q CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
||| GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
J£ CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
J*** CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
Iff CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
Q CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
f| CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGAC T TAAATAAAGGCAGACGC TGT T TT TCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
L RCQ KE T DC DLC L RVAVH LAVH GHWE E P E DE EKFGGAADS GVE E PRNA S L QAQ WL S FQAY PTARC VLLE V 
Q VP AAL VQ FGQS VG S WY DC FE AALGS E VR IWSYTQPRYEKELNHTQQL PAL PWLN VS ADGDNVHL VLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVXEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY IHKRWALVWLACLL FAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



ri 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCC T GACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGT CTACAGCAT CGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCT71ACCCTGGAAGACATCTTCCATGACCTG 
T T CTACCACT TAGAGC TCCAGGTCAACCGCACCTACCAAAT GCACC T TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
M CC TGGGCCAAGGAGAGTGCCCCC TACATGT GC CGAGT GAAGACAC TGCCAGACCGGACAT GGACC 

PJ TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
m CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
% TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
III TCCACAGCGGCATAGCCTGTCCGAGATCACC T AC T TAGGGCAGCCAGACATC TCGAT CC T CCAGC 

|B CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
-J\ GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
* CACCT TAGGCCT AAAGGTCAGC T T CAGAAAGAGCCACCAGC T GGAAGC TGCAT G T TAGGTGGCCT 

IJl TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
rll CCC TGGGGAT T T GCACAGACAGAACAT CTGACCCAAAT GTGCTACACAGTGGGGAGGAAGGGACA 

|Sj CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
**; CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
W TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
Q GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
fjff AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGT GCAAGGAGGAAAT GCAGGGAAAC TCCCGAGGT CCAGAGCCCCACCT C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCC T GGAAAAGAACCAGAAGGAGGC T GGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTT TT CTGCAGGCAGGAGTT TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACT^AGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 16 J. 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVTSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHPT PT P I RAGDGHRL TLE D I FHDL FYHLE LQVNRT YQMHLGGKQRE YE FFGLT PDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQAT PDSWPPS YGVCMEGS GKDS PTGTLS S PKHLRPKGQLQKE PPAGS CMLGGLSLQEVTS LAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites . 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 165 



TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGT T T T GGATAT T T TCCAAAAGAT TT GATCAAGGTAC T T CATAAAT ACACGGA 
AGAAGAGCTACATATT CCAGCAGATGAGACAGACTT TGTCTGCTT TGAAGGAGGAAGAGATGATT 
T T AATAG T TATAAT G T AGAAGAGC T T T T AGGAT C T T T G GAAC T G GAG GAC T C T G TACC T GAAGAG 
fh TCGAAGAAAGC T GAAGAAGT TT C TCAGCACAGAGAGAAATC TCC T GAGGAGT CT CGGGGGCGTGA 

|l! ACT TGACCC TGT GCC T GAGCCCGAGGCAT TCAGAGCTGAT T CAGAGGATGGAGAAGGT GCTT TCT 

CAGAGAGCACCGAGGGGCT GCAGGGACAGCCCT CAGCT CAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGAT7VA 
AT TGAAAGTGCCGGGAAGCGAAAGCAGAAC TGGCAATAGT TCTCC TGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTT ACAAAGT C CT GAAAACAGAAAT GAGT CAGAGAGGAAG TGGACAGTGCGTT AT T 
CAT T AC AG CAAA.GGAT T T C G T T GGCAT C AAAAT C T AAG T T T G T T T TACAAAGAT T G T T T T TAGT A 
C T AAGC T GC C T T GG C AG T T T G CAT T T T T GAGC C AAAC AAAAAT AT AT TAT TTTCCCTTC T AAG T A 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FS E S TEGLQGQPSAQESHPHT S GPAANAQGVQS S LDT FEE I LHDKLKVPGSE SRTGNS S PAS VER 
EKT DAYKVLKT EMS QRGS GQC VI H YS KG FRWHQNL S L FYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCGTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCG ATGA GCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCC T T GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCAT GT TGCAAAACCTGACAAAACCCGGG 
ACAGGGAT T TCT GATGGT GAT TT CT GGATAGGGCT T TGGAGGAATGGAGATGGGCAAACAT CTGG 
C3 TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
O ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
fft CT T GGGGGT CCCT ACCT T TACCAGTGGAATGAT GACAGGTGTAACATGAAGCACAAT TAT ATT T G 

f„l CAAGTAT GAACCAGAGAT TAATCCAACAGCCCCTGTAGAAAAGCC T TATC TTACAAAT CAACCAG 

r|; GAGACAC CCAT CAGAAT GTGGTTGT TAC TGAAGCAGGTATAAT TCCCAAT CTAATT TAT G T TGT T 

III ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
*JJ GCATAAAAG TAAAGGAAGAACAAAAAC TAGTCCAAACCAGT C TACACTGT GGAT T T CAAAGAGTA 

j|i CCAGAAAAQAAAGT GGCATGGAAGTATAATAAC TCAT T GACT T GGT TC CAGAAT TT TGTAAT TCT 

GGATCTGTATAAGGAATGGCATCAGAACAATAGCT TGGAATGGCT TGAAATCACAAAGGATCTGC 
AAGAT GAAC TGTAAGCT CCC CC T T GAGGCAAAT AT TAAAGT AATT T T TATATGT CT AT TAT T T CA 
W TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
|H ACT TCAAACT TCAAGCAAAT GAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 

fh TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
H TGTATAT TGTAT T GAAAT TTACAGTGT GCAAAAG TATT T TACC TT T GCATAAGT GT T TGATAAAA 

JJJ AT GAAC T G T T C T AATAT T TAT T T T TAT GGCATC T CAT T T T TC AAT ACATGC T C T T T T GAT T AAAG 

W AAACT TATTACTGT TGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 

f|J TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATT TCAT TAAGTGTGATATAAACCTCCTCAAACATTTTACT TAGAGGCAAGGAT 
TGTCTAATTTCAAT TGTGCAAGACATGTGCCTTATAAT TATTT TTAGCT TAAAATTAAACAGAT T 
T TG TAATAATGT AACT T TGT TAATAGGTGCATAAACAC TAAT GCAGTCAAT TT GAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TATAT TAGTATACAAAGAGGTCATG TGGT T GAGACCAGGT GAATAGTCACTATCAGT G T GGAG 
ACAAGCACAGCACACAGACATTT TAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTT GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 

MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 



w 

|*| Transmembrane domain: 

amino acids 214-235 

. f% N-glycosylation sites . 

f|| amino acids 86-89, 255-258 

M cAMP- and cGMP-dependent protein kinase phosphorylation site. 

m 

:J; amino acids 2 66-269 

[j N-myristoylation sites. 

Ill amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

145, 212-217 
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